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Preface 


This  book,  based  on  courses  given  to  physics  and  applied  mathematics  stu- 
dents at  Imperial  College,  deals  with  the  mechanics  of  particles  and  rigid 
bodies.  It  is  designed  for  students  with  some  previous  acquaintance  with 
the  elementary  concepts  of  mechanics,  but  the  book  starts  from  first  princi- 
ples, and  little  detailed  knowledge  is  assumed.  An  essential  prerequisite  is  a 
reasonable  familiarity  with  differential  and  integral  calculus,  including  par- 
tial differentiation.  However,  no  prior  knowledge  of  differential  equations 
should  be  needed.  Vectors  are  used  from  the  start  in  the  text  itself;  the 
necessary  definitions  and  mathematical  results  are  collected  in  an  appendix. 

Classical  mechanics  is  a  very  old  subject.  Its  basic  principles  have  been 
known  since  the  time  of  Newton,  when  they  were  formulated  in  the  Prin- 
cipia,  and  the  mathematical  structure  reached  its  mature  form  with  the 
works  of  Lagrange  in  the  late  eighteenth  century  and  Hamilton  in  the  nine- 
teenth. Remarkably  enough,  within  the  last  few  decades  the  subject  has 
once  again  become  the  focus  of  very  active  fundamental  research.  Some 
of  the  most  modern  mathematical  tools  have  been  brought  to  bear  on  the 
problem  of  understanding  the  qualitative  features  of  dynamics,  in  particu- 
lar the  transition  between  regular  and  turbulent  or  chaotic  behaviour.  The 
fourth  edition  of  the  book  was  extended  to  include  new  chapters  providing 
a  brief  introduction  to  this  exciting  work.  In  this  fifth  edition,  the  material 
is  somewhat  expanded,  in  particular  to  contrast  continuous  and  discrete  be- 
haviours. We  have  also  taken  the  opportunity  to  revise  the  earlier  chapters, 
giving  more  emphasis  to  specific  examples  worked  out  in  more  detail. 

Many  of  the  most  fascinating  recent  discoveries  about  the  nature  of  the 
Earth  and  its  surroundings  —  particularly  since  the  launching  of  artificial 
satellites  —  are  direct  applications  of  classical  mechanics.  Several  of  these 
are  discussed  in  the  following  chapters,  or  in  problems. 


vii 


viii 


Classical  Mechanics 


For  physicists,  however,  the  real  importance  of  classical  mechanics  lies 
not  so  much  in  the  vast  range  of  its  applications  as  in  its  role  as  the  base  on 
which  the  whole  pyramid  of  modern  physics  has  been  erected.  This  book, 
therefore,  emphasizes  those  aspects  of  the  subject  which  are  of  importance 
in  quantum  mechanics  and  relativity  —  particularly  the  conservation  laws, 
which  in  one  form  or  another  play  a  central  role  in  all  physical  theories. 

For  applied  mathematicians,  the  methods  of  classical  mechanics  have 
evolved  into  a  much  broader  theory  of  dynamical  systems  with  many  ap- 
plications well  outside  of  physics,  for  example  to  biological  systems. 

The  first  five  chapters  are  primarily  concerned  with  the  mechanics  of  a 
single  particle,  and  Chapter  6,  which  could  be  omitted  without  substantially 
affecting  the  remaining  chapters,  deals  with  potential  theory.  Systems  of 
particles  are  discussed  in  Chapters  7  and  8,  and  rigid  bodies  in  Chapter  9. 
The  powerful  methods  of  Lagrange  are  introduced  at  an  early  stage,  and  in 
simple  contexts,  and  developed  more  fully  in  Chapters  10  and  11.  Chapter 
12  contains  a  discussion  of  Hamiltonian  mechanics,  emphasizing  the  rela- 
tionship between  symmetries  and  conservation  laws  —  a  subject  directly 
relevant  to  the  most  modern  developments  in  physics.  It  also  provides  the 
basis  for  the  later  treatment  of  order  and  chaos  in  Hamiltonian  systems  in 
Chapter  14.  This  follows  the  introduction  to  the  geometrical  description  of 
continuous  dynamical  systems  in  Chapter  13,  which  includes  a  discussion 
of  various  non-mechanics  applications.  Appendices  from  the  fourth  edition 
on  (A)  Vectors,  (B)  Conies,  (C)  Phase-plane  analysis  near  critical  points 
are  supplemented  here  by  a  new  appendix  (D)  Discrete  dynamical  systems 
—  maps. 

The  writing  of  the  first  edition  of  this  book  owed  much  to  the  advice  and 
encouragement  of  the  late  Professor  P.T.  Matthews.  Several  readers  have 
also  helped  by  pointing  out  errors,  particularly  in  the  answers  to  problems. 

We  are  grateful  to  the  following  for  permission  to  reproduce  copyright 
material:  Springer- Verlag  and  Professor  Oscar  E.  Lanford  III  for  Fig.  13.20; 
Cambridge  University  Press  for  Fig.  13.22  and,  together  with  Professors 
G.L.  Baker  and  J. P.  Gollub,  for  Fig.  D.5;  Institute  of  Physics  Publishing 
Limited  and  Professor  M.V.  Berry  for  Figs.  14.11,  14.12  and  14.13:  the 
Royal  Society  and  Professors  W.P.  Reinhardt  and  I.  Dana  for  the  figure  in 
the  answer  to  Appendix  D,  Problem  13. 

T.W.B.  Kibble 
F.H.  Berkshire 
Imperial  College  London 
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Useful  Constants  and  Units 


Physical  constants 


speed  of  light 

c 

— 

2.99792458  x  10sms 

gravitational  constant 

G 

= 

6.673  x  10"11  N  m2  kg" 

Planck  s  constant/27r 

h 

= 

1.054  57  x  10~34  Js 

mass  of  hydrogen  atom 

mH 

= 

1.673  52  x  10    '  kg 

mass  of  electron 

me 

= 

9.109  38  x  10"31  kg 

charge  of  proton 

e 

— 

1.602  18  x  10  ly  C 

permittivity  of  vacuum 

eo 

8.85419  x  10  12  Fm  1 

permeability  of  vacuum 

fJ-o 

4tt  x  10-7Hm-1 

Defined  standard  values 

standard  gravitational  acceleration 

9n 

9.806  65  ms"2 

normal  atmospheric  pressure 

1  atm 

1.013  25  x  105Pa 

1.013  25  bar 

Properties  of  Earth 

mass 

M 

5.974  x  1024kg 

GM 

3.9860  x  1014m3s"2 

radius  (polar) 

Rp 

6356.8  km 

(equatorial) 

Re 

6378.1km 

(mean)[=  (R2eRP)^} 

R 

6371.0  km 

semi-major  axis  of  orbit 

a 

1.495  98  x  108km 

eccentricity  of  orbit 

e 

0.016  722 

orbital  period  (sidereal  year) 

r 

3.155  75  x  107s 

mean  orbital  velocity 

V 

29.785  kms"1 

surface  escape  velocity  (mean) 

Ve 

ll.lSkms"1 

rotational  angular  velocity 

w 

7.2921  x  lO^s"1 
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Properties  of  Sun  and  Moon 

mass  of  Sun  Ms  =  1.989  x  1030  kg 

=  3.329  5  x  105M 
GMS  =  1.32  7  1  2  x  1020m 

mass  of  Moon  MM  =  7.348  x  1022  kg 

=  0.012  300  M 

semi-major  axis  of  lunar  orbit  ayi  =  3.8440  x  105  km 

lunar  orbital  period  (sidereal  month)        tm  =  2.3606  x  106s 


SI  units 

kilogramme  (kg):  mass  of  the  international  standard  kilogramme  kept  at 
Sevres  in  France. 

second  (s):  9192  631770  oscillation  periods  of  the  hyperfine  transition 
between  the  levels  F  =  4,  tuf  =  0  and  F  =  3,mjr  =  0  in  the  ground 
state  of  133  Cs. 

metre  (m):  distance  travelled  by  light  in  vacuum  in  (1/299  792  458)  s. 

ampere  (A):  defined  so  that  the  force  per  unit  length  between  two  infinitely 
long  parallel  wires  of  negligible  cross  section  1  m  apart  in  vacuum,  each 
carrying  a  current  of  1  A  is  2  x  10-7Nm_1  (or,  equivalently,  so  that 
the  constant  fiQ  has  the  precise  value  Air  x  10_7NA-2). 

(The  kelvin,  candela  and  mole  are  not  used  in  this  book.) 


Subsidiary  units 

newton  IN  =  1  kg  m  s 
joule  1J  =  1 N  m 

watt  lW=Us-1 
coulomb      1  C  =  1  A  s 


pascal  1  Pa  =  1 N  m 

ton(ne)  It  =  103  kg 

bar  1  bar  =  105  Pa 

hertz  1  Hz  =  1  s_1 


British  and  American  units 

foot  1ft  =  0.3048  m  pound        1  lb  =  0.45259237  kg 


Prefixes  denoting  multiples  and  submultiples 


103 

kilo  (k) 

10" 

-3 

milli  (m) 

106 

mega  (M) 

10" 

-6 

micro  (/i) 

109 

giga  (G) 

10" 

-9 

nano  (n) 

1012 

tera  (T) 

10" 

-12 

pico  (p) 

1015 

peta  (P) 

10" 

-15 

femto  (f) 

1018 

exa  (E) 

10" 

-  18 

atto  (a) 

List  of  Symbols 


The  following  list  is  not  intended  to  be  exhaustive.  It  includes  symbols  of 
frequent  occurrence  or  special  importance.  The  figures  refer  to  the  section 
in  which  the  symbol  is  defined. 


A,  Aa 

complex  amplitude 

2.3,  11.3 

A 

vector  potential 

A.7 

a,  a 

acceleration 

1.2 

a 

amplitude  of  oscillation 

2.2 

a 

semi-major  axis  of  orbit 

4.4,  B.l 

a 

equatorial  radius 

6.5 

B 

magnetic  field 

5.2,  A.7 

b 

semi-minor  axis  of  orbit 

4.4,  B.l 

b 

impact  parameter 

4.3,  4.5 

c 

polar  radius 

6.5 

c 

propagation  velocity 

10.6 

c 

control  parameters 

13.1 

d 

dipole  moment 

6.2,  6.4 

E 

total  energy 

2.1,  3.1,  8.5 

E 

electric  field 

6.1,  A.7 

e 

base  of  natural  logarithms 

e 

(minus)  electron  charge 

4.7 

e 

eccentricity  of  orbit 

4.4,  B.2 

ei,e2,e3 

principal  axes 

9.4,  A.10 

F  Fi,  Fij 

force 

1.2,  8.1 

Fa 

generalized  force 

3.7,  10.2 

F 

phase-space  velocity 

13.1 

Ft 

constants  of  the  motion 

14.1 

f 

particle  flux 

4.5 
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G 

gravitational  constant 

1.2 

G 

moment  of  force 

3.3 

G 

generator  of  transformation 

12.7 

9,9 

gravitational  acceleration 

5.3,  6.1 

q* 

observed  gravitational  acceleration 

5.3 

H 

Hamiltonian  function 

12.1 

I 

action  integral 

3.7 

moment  of  inertia 

9.2,  9.3 

Ixy 

product  of  inertia 

9.3 

principal  moment  of  inertia 

9.4,  9.5 

h 

action  variables 

14.3 

i 

i 

particle  index 

1.2,  8.1 

i,3,k 

unit  vectors 

A.l,  1.1 

J,  J* 

angular  momentum 

3.3,  7.1,  8.3 

oscillator  constant 

2.2,  11.2 

k 

inverse  square  law  constant 

4.3 

L 

Lagrangian  function 

3.7,  10.2 

I 

semi-latus  rectum  of  orbit 

4.3,  4.4,  B.2 

In 

natural  logarithm 

M 

total  mass 

7.1,  8.1 

M 

Jacobian  matrix 

14.1 

M 

invariant  torus 

14.1 

m,  rrii 

mass 

1.2,  6.1 

N 

number  of  particles 

1.2,  8.1 

n 

number  of  degrees  of  freedom 

10.1 

P 

total  momentum 

7.1,  8.1 

P,Pt,P* 

momentum 

1.2,  7.2 

Pi,  Pa 

generalized  momentum 

3.7,  12.1 

P 

exponential  rate  coefficient 

2.2 

Q 

quality  factor 

2.5 

Q 

quadrupole  moment 

6.2,  6.4 

1,  Qi 

electric  charge 

1.2,  6.1 

It 

curvilinear  co-ordinate 

3.7,  A.8 

qa 

generalized  co-ordinate 

10.1 

R 

radius  of  Earth 

4.3 

R 

position  of  centre  of  mass 

7.1,  8.1 

r 

radial  co-ordinate 

3.5 

List  of  Symbols  xix 


r,ri 

position  vector 

1.1,  A.l 

relative  position 

1.2,  7.1 

r* 

position  in  CM  frame 

7.2,  8.3 

T,  T* 

kinetic  energy 

2.1,  3.1,  7.2,  8.5 

T,Te 

twist  map 

D.3 

t 

time 

1.1 

U 

effective  potential  energy  function 

4.2,  12.4 

V 

potential  energy 

2.1,  3.1,  8.5,  13.3 

V,Vi 

velocity 

1.2 

dW,SW 

work  done  in  small  displacement 

2.4,  10.2 

X,Y,Z 

co-ordinates  of  centre  of  mass  R 

10.1 

X,Y 

fixed  points  of  maps 

D.l 

x,y,z 

Cartesian  co-ordinates 

A.l,  1.1 

x,  x% ,  x,  y 

phase-space  co-ordinates 

13.1,  13.2 

iterates  of  maps 

D.l 

a 

rotation  number 

D.3 

7 

damping  constant 

2.5 

5 

small  variation 

3.6 

6 

Feigenbaum  number 

D.l 

e 

oblateness 

6.5 

eo 
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Chapter  1 

Introduction 


Classical  mechanics  is  one  of  the  most  familiar  of  scientific  theories.  Its  basic 
concepts  —  mass,  acceleration,  force,  and  so  on  —  have  become  very  much 
a  part  of  our  everyday  modes  of  thought.  So  we  may  easily  regard  their 
physical  meaning  as  more  obvious  than  it  really  is.  For  this  reason,  a  large 
part  of  this  introductory  chapter  will  be  devoted  to  a  critical  examination 
of  the  fundamental  concepts  and  principles  of  mechanics. 

Every  scientific  theory  starts  from  a  set  of  hypotheses,  which  are  sug- 
gested by  our  observations,  but  represent  an  idealization  of  them.  The 
theory  is  then  tested  by  checking  the  predictions  deduced  from  these  hy- 
potheses against  experiment.  When  persistent  discrepancies  are  found,  we 
try  to  modify  the  hypotheses  to  restore  the  agreement  with  observation.  If 
many  such  tests  are  made  and  no  serious  disagreement  emerges,  then  the 
hypotheses  gradually  acquire  the  status  of  'laws  of  nature'.  When  results 
that  apparently  contradict  well-established  laws  appear,  as  they  often  do, 
we  tend  to  look  for  other  possible  explanations  —  for  simplifying  assump- 
tions we  have  made  that  may  be  wrong,  or  neglected  effects  that  may  be 
significant. 

It  must  be  remembered  however  that,  no  matter  how  impressive  the 
evidence  may  be,  we  can  never  claim  for  these  laws  a  universal  validity.  We 
may  only  be  confident  that  they  provide  a  good  description  of  that  class  of 
phenomena  for  which  their  predictions  have  been  adequately  tested.  One 
of  the  earliest  examples  is  provided  by  Euclid's  axioms.  On  any  ordinary 
scale,  they  are  unquestionably  valid,  but  we  are  not  entitled  to  assume  that 
they  should  necessarily  apply  on  either  a  cosmological  or  a  sub-microscopic 
scale.  Indeed,  they  have  been  modified  in  Einstein's  theory  of  gravitation 
('general  relativity'). 

The  laws  of  classical  mechanics  are  no  exception.  Since  they  were  first 
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formulated  by  Galileo  and  by  Newton  in  his  Principia,  their  range  of  known 
validity  has  been  enormously  extended,  but  in  two  directions  they  have  been 
found  to  be  inadequate.  For  the  description  of  the  small-scale  phenomena 
of  atomic  and  nuclear  physics,  classical  mechanics  has  been  superseded  by 
quantum  mechanics,  and  for  phenomena  involving  speeds  approaching  that 
of  light,  by  relativity. 

This  is  not  to  say  that  classical  mechanics  has  lost  its  value.  Indeed 
both  quantum  mechanics  and  the  special  and  general  theories  of  relativity 
are  extensions  of  classical  mechanics  in  the  sense  that  they  reproduce  its 
results  in  appropriate  limiting  cases.  Thus  the  fact  that  these  theories  have 
been  confirmed  actually  reinforces  our  belief  in  the  correctness  of  classical 
mechanics  within  its  own  vast  range  of  validity.  Indeed,  it  is  a  remarkably 
successful  theory,  which  provides  a  coherent  and  satisfying  account  of  phe- 
nomena as  diverse  as  the  planetary  orbits,  the  tides  and  the  motion  of  a 
gyroscope.  Moreover,  even  outside  this  range,  many  of  the  results  of  clas- 
sical mechanics  still  apply.  In  particular,  the  conservation  laws  of  energy, 
momentum  and  angular  momentum  are,  so  far  as  we  yet  know,  of  universal 
validity. 

1.1    Space  and  Time 

The  most  fundamental  assumptions  of  physics  are  probably  those  concerned 
with  the  concepts  of  space  and  time.  We  assume  that  space  and  time  are 
continuous,  that  it  is  meaningful  to  say  that  an  event  occurred  at  a  specific 
point  in  space  and  a  specific  instant  of  time,  and  that  there  are  univer- 
sal standards  of  length  and  time  (in  the  sense  that  observers  in  different 
places  and  at  different  times  can  make  meaningful  comparisons  of  their 
measurements).  These  assumptions  are  common  to  the  whole  of  physics, 
and,  though  all  are  being  challenged,  there  is  as  yet  no  compelling  evidence 
that  we  have  reached  the  limits  of  their  range  of  validity. 

In  'classical'  physics,  we  assume  further  that  there  is  a  universal  time 
scale  (in  the  sense  that  two  observers  who  have  synchronized  their  clocks 
will  always  agree  about  the  time  of  any  event),  that  the  geometry  of  space  is 
Euclidean,  and  that  there  is  no  limit  in  principle  to  the  accuracy  with  which 
we  can  measure  all  positions  and  velocities.  These  assumptions  have  been 
somewhat  modified  in  quantum  mechanics  and  relativity.  Here,  however, 
we  shall  take  them  for  granted,  and  concentrate  our  attention  on  the  more 
specific  assumptions  of  classical  mechanics. 
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The  relativity  principle 

In  Aristotle's  conception  of  the  universe,  the  fact  that  heavy  bodies  fall 
downwards  was  explained  by  supposing  that  each  element  (earth,  air,  fire, 
water)  has  its  own  appointed  sphere,  to  which  it  tends  to  return  unless 
forcibly  prevented  from  so  doing.  The  element  earth,  in  particular,  tends 
to  get  as  close  as  it  can  to  the  centre  of  the  Universe,  and  therefore  forms 
a  sphere  about  this  point.  In  this  kind  of  explanation,  the  central  point 
plays  a  special,  distinguished  role,  and  position  in  space  has  an  absolute 
meaning. 

In  Newtonian  mechanics,  on  the  other  hand,  bodies  fall  downward  be- 
cause they  are  attracted  towards  the  Earth,  rather  than  towards  some  fixed 
point  in  space.  Thus  position  has  a  meaning  only  relative  to  the  Earth, 
or  to  some  other  body.  In  just  the  same  way,  velocity  has  only  a  relative 
significance.  Given  two  bodies  moving  with  constant  relative  velocity,  it  is 
impossible  in  principle  to  decide  which  of  them  is  at  rest,  and  which  mov- 
ing. This  statement,  which  is  of  fundamental  importance,  is  the  principle 
of  relativity. 

Acceleration,  however,  still  retains  an  absolute  meaning,  since  it  is  ex- 
perimentally possible  to  distinguish  between  motion  with  uniform  velocity 
(i.e.,  constant  in  magnitude  and  direction)  and  accelerated  motion.  If  we 
are  sitting  inside  an  aircraft,  we  can  easily  detect  its  acceleration,  but  we 
cannot  measure  its  velocity  —  though  by  looking  out  we  can  estimate  its 
velocity  relative  to  objects  outside.  (In  Einstein's  theory  of  general  relativ- 
ity, even  acceleration  becomes  a  relative  concept,  at  least  on  a  small  scale. 
This  is  made  possible  by  the  fact  that,  to  an  observer  in  a  confined  region 
of  space,  the  effects  of  being  accelerated  and  of  being  in  a  gravitational  field 
are  indistinguishable.) 

If  two  unaccelerated  observers  perform  the  same  experiment,  they  must 
arrive  at  the  same  results.  It  makes  no  difference  whether  it  is  performed 
on  the  ground  or  in  a  smoothly  travelling  vehicle.  However,  an  accelerated 
observer  who  performs  the  experiment  may  well  get  a  different  answer.  The 
relativity  principle  asserts  that  all  unaccelerated  observers  are  equivalent; 
it  says  nothing  about  accelerated  observers. 

Inertial  frames 

It  is  useful  at  this  point  to  introduce  the  concept  of  a  frame  of  reference. 
To  specify  positions  and  time,  each  observer  may  choose  a  zero  of  the  time 
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scale,  an  origin  in  space,  and  a  set  of  three  Cartesian  co-ordinate  axes.  We 
shall  refer  to  these  collectively  as  a  frame  of  reference.  The  position  and 
time  of  any  event  may  then  be  specified  with  respect  to  this  frame  by  the 
three  Cartesian  co-ordinates  x,  y,  z  and  the  time  t.  If  we  are  located  on  a 
solid  body,  such  as  the  Earth,  we  may,  for  example,  choose  some  point  of 
the  body  as  the  origin,  and  take  the  axes  to  be  rigidly  fixed  to  it  (though, 
as  we  discuss  later,  this  frame  is  not  quite  unaccelerated). 

In  view  of  the  relativity  principle,  the  frames  of  reference  used  by  differ- 
ent unaccelerated  observers  are  completely  equivalent.  The  laws  of  physics 
expressed  in  terms  of  our  x,y,z,t  must  be  identical  with  those  of  someone 
else's  x',  y',  z',  t'.  They  are  not,  however,  identical  with  the  laws  expressed 
in  terms  of  the  co-ordinates  used  by  an  accelerated  observer.  The  frames 
used  by  unaccelerated  observers  are  called  inertial  frames. 

We  have  not  yet  said  how  we  can  tell  whether  a  given  observer  is  un- 
accelerated. We  need  a  criterion  to  distinguish  inertial  frames  from  the 
others.  Formally,  an  inertial  frame  may  be  defined  to  be  one  with  respect 
to  which  any  isolated  body,  far  removed  from  all  other  matter,  would  move 
with  uniform  velocity.  This  is  of  course  an  idealized  definition,  since  in 
practice  we  can  never  get  infinitely  far  away  from  other  matter.  For  all 
practical  purposes,  however,  an  inertial  frame  is  one  whose  orientation  is 
fixed  relative  to  the  'fixed'  stars,  and  in  which  the  Sun  (or  more  precisely 
the  centre  of  mass  of  the  solar  system)  moves  with  uniform  velocity.  It  is  an 
essential  assumption  of  classical  mechanics  that  such  frames  exist.  Indeed, 
this  assumption  (together  with  a  definition  of  inertial  frames)  is  the  real 
physical  content  of  Newton's  first  law  (a  body  acted  on  by  no  forces  moves 
with  uniform  velocity  in  a  straight  line) . 

It  is  generally  convenient  to  use  only  inertial  frames,  but  there  is  no 
necessity  to  do  so.  Sometimes  it  proves  convenient  to  use  a  non-inertial  (in 
particular,  rotating)  frame,  in  which  the  laws  of  mechanics  take  on  a  more 
complicated  form.  For  example,  we  shall  discuss  in  Chapter  5  the  use  a 
frame  rigidly  fixed  to  the  rotating  Earth. 

Vectors 

It  is  often  convenient  to  use  a  notation  which  does  not  refer  explicitly  to  a 
particular  set  of  co-ordinate  axes.  Instead  of  using  Cartesian  co-ordinates 
x,  y,  z  we  may  specify  the  position  of  a  point  P  with  respect  to  a  given 
origin  O  by  the  length  and  direction  of  the  line  OP.  A  quantity  which  is 
specified  by  a  magnitude  and  a  direction  is  called  a  vector,  in  this  case  the 
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position  vector  r  of  P  with  respect  to  O.  Many  other  physical  quantities  are 
also  vectors:  examples  are  velocity  and  force.  They  are  to  be  distinguished 
from  scalars  —  like  mass  and  energy  —  which  are  completely  specified  by 
a  magnitude  alone. 

We  assume  here  that  readers  are  familiar  with  the  ideas  of  vector  alge- 
bra; if  not,  they  will  find  a  discussion  which  includes  all  the  results  we  shall 
need  in  Appendix  A. 

Throughout  this  book,  vectors  will  be  denoted  by  boldface  letters  (like 
a).  The  magnitude  of  the  vector  will  be  denoted  by  the  corresponding 
letter  in  ordinary  italic  type  (a),  or  by  the  use  of  vertical  bars  (\a\).  The 
scalar  and  vector  products  of  two  vectors  a  and  b  will  be  written  a  ■  b  and 
a  A  6  respectively.  We  shall  use  r  to  denote  the  unit  vector  in  the  direction 
of  r,  r  =  r/r.  The  unit  vectors  along  the  x-,  y-,  z-axes  will  be  denoted  by 
i,  j,  k,  so  that 

r  =  xi  +  yj  +  zk. 

We  shall  use  the  vector  notation  in  formulating  the  basic  laws  of  mechan- 
ics, both  because  of  the  mathematical  simplicity  thereby  attained,  and  be- 
cause the  physical  ideas  behind  the  mathematical  formalism  are  often  much 
clearer  in  terms  of  vectors. 

1.2    Newton's  Laws 

Classical  mechanics  describes  how  physical  objects  move,  how  their  posi- 
tions change  with  time.  Its  basic  laws  may  be  applied  to  objects  of  any  size 
(above  the  atomic  level)  and  of  any  shape  and  internal  structure,  and,  in 
classical  hydrodynamics,  to  fluids  too.  It  is  not  immediately  obvious,  how- 
ever, what  is  meant  by  the  'position'  of  a  large  object  of  complex  shape. 
Only  in  the  idealized  case  of  point  particles  (which  do  not  exist  in  nature) 
does  this  concept  have  an  intuitively  obvious  meaning.  We  shall  therefore 
consider  first  only  small  bodies  which  can  be  effectively  located  at  a  point, 
so  that  the  position  of  each,  at  time  t,  can  be  specified  by  a  position  vector 
r(t). 

When  we  come  to  deal  with  large  extended  bodies,  in  Chapter  8,  we 
shall  make  the  additional  assumption  that  any  such  body  may  be  divided 
up  into  a  large  number  of  very  small  bodies,  each  of  which  may  be  treated 
as  a  point  particle.  (We  shall  also  need  to  make  some  assumptions  about 
the  nature  of  the  internal  forces  between  these  particles.)  We  shall  then  find 
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that  if  we  are  interested  in  the  overall  motion  of  even  a  very  large  object, 
such  as  a  planet,  we  may  often  legitimately  treat  it  as  a  point  particle 
located  at  the  centre  of  mass  of  the  body.  The  laws  themselves  prescribe 
the  meaning  of  the  'position'  of  an  extended  body. 

We  shall  begin  by  simply  stating  Newton's  laws,  and  defer  to  the  follow- 
ing section  a  discussion  of  the  physical  significance  of  the  concepts  involved, 
particularly  those  of  mass  and  force. 

Let  us  consider  an  isolated  system  comprising  N  bodies,  which  we  label 
by  an  index  i  =  1,2, . . . ,  N .  By  saying  that  the  system  is  isolated,  we  mean 
that  all  other  bodies  are  sufficiently  remote  to  have  a  negligible  influence 
on  it.  Each  of  the  N  bodies  is  assumed  to  be  small  enough  to  be  treated 
as  a  point  particle.  The  position  of  the  zth  body  with  respect  to  a  given 
inertial  frame  will  be  denoted  by  Ti{i).  Its  velocity  and  acceleration  are 

Vi(t)  =  rt{t), 

a,i(t)  =  Vi{t)  =  r-i{t), 

where  the  dots  denote  differentiation  with  respect  to  the  time  t.  For  exam- 
ple 

dr 

r  =  ~r- 

dt 

Each  body  is  characterized  by  a  scalar  constant,  its  mass  mj.  Its  momentum 
p{  is  defined  to  be  mass  x  velocity: 

Pi  =  rriiVi. 

The  equation  of  motion,  which  specifies  how  the  body  will  move  is  Newton's 
second  law  (mass  x  acceleration  =  force): 

pi=miai  =  Fu  (1.1) 

where  Fi  is  the  total  force  acting  on  the  body.  This  force  is  composed  of 
a  sum  of  forces  due  to  each  of  the  other  bodies  in  the  system.  If  we  denote 
the  force  on  the  ith  body  due  to  the  jth  body  by  Fjj ,  then 

TV 

Fi  =  Fil+Fi2  +  ---+FiN  =  ^Ftj,  (1.2) 

where  of  course  Fa  =  0,  since  there  is  no  force  on  the  ith  body  due  to 
itself.  Note  that  since  the  sum  on  the  right  side  of  (1.2)  is  a  vector  sum, 
this  equation  incorporates  the  'parallelogram  law'  of  composition  of  forces. 
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The  two-body  forces  F,j  must  satisfy  Newton's  third  law,  which  asserts 
that  'action'  and  'reaction'  are  equal  and  opposite, 

F  ji  =  -F  ij.  (1.3) 

Moreover,  Fij  is  a  function  of  the  positions  and  velocities  (and  internal 
structure)  of  the  ith  and  jth  bodies,  but  is  unaffected  by  the  presence  of 
the  other  bodies.  (It  can  be  argued  that  this  is  an  unnecessarily  restrictive 
assumption.  It  would  be  perfectly  possible  to  include  also,  say,  three-body 
forces,  which  depend  on  the  positions  and  velocities  of  three  particles  si- 
multaneously. However,  within  the  realm  of  validity  of  classical  mechanics, 
no  such  forces  are  known,  and  their  inclusion  would  be  an  inessential  com- 
plication.) Because  of  the  relativity  principle,  the  force  can  in  fact  depend 
only  on  the  relative  position 

(see  Fig.  1.1),  and  the  relative  velocity 

Vtj  =Vi~  Vj. 


Fig.  1.1 

If  the  forces  are  known,  as  functions  of  the  positions  and  velocities, 
then  from  (1.1)  we  can  predict  the  future  motion  of  the  bodies.  Given  their 
initial  positions  and  velocities,  we  can  solve  these  equations  (analytically 
or  numerically)  to  find  their  positions  at  a  later  time. 

There  is  here  an  implicit  assumption  of  perfect  knowledge  and  infinite 
precision  of  calculation.  It  is  now  recognized  (see  Chapters  13,  14)  that  this 
assumption  is,  in  general,  false,  leading  to  a  loss  of  predictability.  However, 
for  the  time  being,  we  shall  assume  that  our  solution  can  be  effected. 
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All  that  then  remains  is  to  specify  the  precise  laws  by  which  the  two- 
body  forces  are  to  be  determined.  The  most  important  class  of  forces  con- 
sists of  the  central,  conservative  forces,  which  depend  only  on  the  relative 
positions  of  the  two  bodies,  and  have  the  form 

FH  =rijf(rij),  (1.4) 

where,  as  usual,  fy  is  the  unit  vector  in  the  direction  of  r%j  and  f(rij)  is 
a  scalar  function  of  the  relative  distance  r^.  If  /(r^)  is  positive  the  force 
Fij  is  a  repulsive  force  directed  outwards  along  the  line  joining  the  bodies; 
if  f(rij)  is  negative,  it  is  an  attractive  force,  directed  inwards. 

According  to  Newton's  law  of  universal  gravitation,  there  is  a  force  of 
this  type  between  every  pair  of  bodies,  proportional  in  magnitude  to  the 
product  of  their  masses.  It  is  given  by  (1.4)  with 

/(r«)  =  -^,  (1-5) 

ij 

where  G  is  Newton's  gravitational  constant,  whose  value  is 

G  =  6.673  x  10-nNm2kg"2. 

Since  the  masses  are  always  positive,  this  force  is  always  attractive. 

In  addition,  if  the  bodies  are  electrically  charged,  there  is  an  electrostatic 
Coulomb  force  given  by 


QiQj_ 

where  %  and  qj  are  the  electric  charges,  and  eo  is  another  constant, 

e0  =  8.85419  x  10_12Fm_1. 

Note  that  the  analogue  of  Newton's  constant  G  is 

l/47reo  =  8.987  55  x  109Nm2C"2. 

Electric  charges  may  be  of  either  sign,  and  therefore  the  electrostatic  force 
may  be  repulsive  or  attractive  according  to  the  relative  sign  of  q\  and  qj . 

Note  the  enormous  difference  in  the  orders  of  magnitude  of  the  constants 
G  and  1/4iycq  when  expressed  in  SI  units.  This  serves  to  illustrate  the 
fact  that  gravitational  forces  are  really  exceptionally  weak.  They  appear 
significant  to  us  only  because  we  happen  to  live  close  to  a  body  of  very 
large  mass.  Correspondingly  large  charges  never  appear,  because  positive 
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and  negative  charges  largely  cancel  out,  leaving  macroscopic  bodies  with  a 
net  charge  close  to  zero. 

In  bodies  with  structure,  central,  conservative  forces  between  their  con- 
stituent parts  can  evidently  give  rise  to  forces  which  are  still  conservative 
(i.e.,  which  are  independent  of  velocity,  and  satisfy  some  further  conditions 
that  need  not  worry  us  here  —  see  §3.1  and  §A.6),  but  no  longer  central 
(i.e.,  not  directed  along  the  line  joining  the  bodies).  This  can  happen,  for 
example,  if  there  is  a  distribution  of  electric  charge  within  each  body. 

They  can  also  give  rise,  in  a  less  obvious  way,  to  non-conservative, 
velocity-dependent  forces,  as  we  shall  see  in  Chapter  2.  Many  resistive  and 
frictional  forces  can  be  understood  as  macroscopic  effects  of  forces  which 
are  really  conservative  on  a  small  scale.  The  chief  distinguishing  feature  of 
conservative  forces  is  the  existence  of  a  quantity  which  is  conserved,  i.e., 
whose  total  value  never  changes,  namely  the  energy  of  the  system.  Fric- 
tional forces  have  the  effect  of  transferring  some  of  this  energy  from  the 
large-scale  motion  of  the  bodies  to  small-scale  movements  in  their  interior, 
and  therefore  appear  non-conservative  on  a  large  scale. 

In  a  sense,  therefore,  we  may  regard  central,  conservative  forces  as  the 
norm.  It  would  be  wrong  to  conclude,  however,  that  we  can  explain  every- 
thing in  terms  of  them.  In  the  first  place,  the  concepts  of  classical  mechanics 
cannot  be  applied  to  the  really  small-scale  structure  of  matter.  For  that, 
we  need  quantum  mechanics.  More  serious  is  the  existence  of  electromag- 
netic forces,  which  are  of  great  importance  even  in  the  realm  of  classical 
physics,  but  which  cannot  readily  be  accommodated  in  the  framework  of 
classical  mechanics.  The  force  between  two  charges  in  relative  motion  is 
neither  central  nor  conservative,  and  does  not  even  satisfy  Newton's  third 
law  (1.3).  This  is  a  consequence  of  the  finite  velocity  of  propagation  of 
electromagnetic  waves.  The  force  on  one  charge  depends  not  only  on  the 
instantaneous  position  of  the  other,  but  on  its  past  history.  The  effect  of 
a  disturbance  of  one  charge  is  not  felt  immediately  by  the  other,  but  after 
an  interval  of  time  sufficient  for  a  light  signal  to  propagate  from  one  to  the 
other.  This  particular  difficulty  may  be  resolved  by  introducing  the  concept 
of  the  electromagnetic  field.  Then  we  may  suppose  that  one  charge  does 
not  act  directly  on  the  other,  but  on  the  field  in  its  immediate  vicinity;  this 
in  turn  affects  the  field  further  out,  and  so  on.  By  supposing  that  the  field 
itself  can  carry  energy  and  momentum,  we  can  reinstate  the  conservation 
laws,  which  are  among  the  most  important  consequences  of  Newton's  laws. 

However,  this  does  not  completely  remove  the  difficulty,  for  there  is 
still  an  apparent  contradiction  between  this  classical  electromagnetic  theory 
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and  the  principle  of  relativity  discussed  in  §1.1.  This  arises  from  the  fact 
that  if  the  speed  of  light  is  constant  with  respect  to  one  inertial  frame 
—  as  it  should  be  according  to  electromagnetic  theory  —  then  the  usual 
rules  for  combining  velocities  would  lead  to  the  conclusion  that  it  is  not 
constant  with  respect  to  a  relatively  moving  frame,  in  contradiction  with  the 
statement  that  all  inertial  frames  are  equivalent.  This  paradox  can  only  be 
resolved  by  the  introduction  of  Einstein's  theory  of  relativity  (i.e.,  'special' 
relativity) .  Classical  electromagnetic  theory  and  classical  mechanics  can  be 
incorporated  into  a  single  self-consistent  theory,  but  only  by  ignoring  the 
relativity  principle  and  sticking  to  one  'preferred'  inertial  frame. 

1.3    The  Concepts  of  Mass  and  Force 

It  is  an  important  general  principle  of  physics  (though  not  universally  ap- 
plied!) that  no  quantity  should  be  introduced  into  the  theory  which  cannot, 
at  least  in  principle,  be  measured.  Now,  Newton's  laws  involve  not  only 
the  concepts  of  velocity  and  acceleration,  which  can  be  measured  by  mea- 
suring distances  and  times,  but  also  the  new  concepts  of  mass  and  force. 
To  give  the  laws  a  physical  meaning  we  have,  therefore,  to  show  that  these 
are  measurable  quantities.  This  is  not  quite  as  trivial  as  it  might  seem, 
because  any  experiment  designed  to  measure  these  quantities  must  nec- 
essarily involve  Newton's  laws  themselves  in  its  interpretation.  Thus  the 
operational  definitions  of  mass  and  force  —  the  prescriptions  of  how  they 
may  be  measured  —  which  are  required  to  make  the  laws  physically  signifi- 
cant, are  actually  contained  in  the  laws  themselves.  This  is  by  no  means  an 
unusual  or  logically  objectionable  situation,  but  it  may  clarify  the  status 
of  these  concepts  to  reformulate  the  laws  in  such  a  way  as  to  isolate  their 
definitional  element. 

Let  us  consider  first  the  measurement  of  mass.  Since  the  units  of  mass 
are  arbitrary,  we  have  to  specify  a  way  of  comparing  the  masses  of  two  given 
bodies.  It  is  important  to  realize  that  we  are  discussing  here  the  inertial 
mass,  which  appears  in  Newton's  second  law,  (1.1)  and  not  the  gravitational 
mass,  which  appears  in  (1.5).  The  two  are  of  course  proportional,  but  this 
equivalence  principle  is  a  physical  law  derived  from  experimental  observa- 
tion (in  particular  from  Galileo's  observations  of  falling  bodies,  from  which 
he  deduced  that  in  a  vacuum  all  bodies  would  fall  equally  fast)  rather  than 
an  a  priori  assumption.  To  verify  the  law,  we  must  be  able  to  measure  each 
kind  of  mass  separately.  This  rules  out,  for  example,  the  use  of  a  balance, 
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which  compares  gravitational  masses. 

Clearly,  we  can  compare  the  inertial  masses  of  two  bodies  by  subjecting 
them  to  equal  forces  and  comparing  their  accelerations,  but  this  does  not 
help  unless  we  have  some  way  of  knowing  that  the  forces  are  equal.  However 
there  is  one  case  in  which  we  do  know  this,  because  of  Newton's  third  law.  If 
we  isolate  the  two  bodies  from  all  other  matter,  and  compare  their  mutually 
induced  accelerations,  then  according  to  (1.1)  and  (1.3), 


so  that  the  accelerations  are  oppositely  directed,  and  inversely  proportional 
to  the  masses.  If  we  allow  two  small  bodies  to  collide,  then  during  the  colli- 
sion the  effects  of  more  remote  bodies  are  generally  negligible  in  comparison 
with  their  effect  on  each  other,  and  we  may  treat  them  approximately  as 
an  isolated  system.  (Such  collisions  will  be  discussed  in  detail  in  Chapters 
2  and  7.)  The  mass  ratio  can  then  be  determined  from  measurements  of 
their  velocities  before  and  after  the  collision,  by  using  (1.7)  or  its  immediate 
consequence,  the  law  of  conservation  of  momentum, 


If  we  wish  to  separate  the  definition  of  mass  from  the  physical  content  of 
equation  (1.7),  we  may  adopt  as  a  fundamental  axiom  the  following: 

In  an  isolated  two-body  system,  the  accelerations  always 
satisfy  the  relation  ai  =  —  A;2ia2,  where  the  scalar  &2i 
is,  for  two  given  bodies,  a  constant  independent  of  their 
positions,  velocities  and  internal  states. 

If  we  choose  the  first  body  to  be  a  standard  body,  and  conventionally  assign 
it  unit  mass  (say  mi  =  1  kg),  then  we  may  define  the  mass  of  the  second 
to  be  &2i  in  units  of  this  standard  mass  (here  mi  =  kn\  kg). 

Note  that  for  consistency,  we  must  have  fci2  =  l/foi.  We  must  also 
assume  of  course  that  if  we  compare  the  masses  of  three  bodies  in  this  way, 
we  obtain  consistent  results: 

For  any  three  bodies,  the  constants      satisfy  £31  =  k^iku  ■ 

It  then  follows  that  for  any  two  bodies,  k$2  is  the  mass  ratio:       =  mz/m,2- 
To  complete  the  list  of  fundamental  axioms,  we  need  one  which  deals 
with  systems  containing  more  than  two  bodies,  analogous  to  the  law  of 
composition  of  forces,  (1.2).  This  may  be  stated  as  follows: 


miOl  =  —771202, 


(1.7) 


m\V\  +  ni2V2  =  constant. 


(1.8) 
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The  acceleration  induced  in  one  body  by  another  is  some 
definite  function  of  their  positions,  velocities  and  internal 
structure,  and  is  unaffected  by  the  presence  of  other  bodies. 
In  a  many-body  system,  the  acceleration  of  any  given  body 
is  equal  to  the  sum  of  the  accelerations  induced  in  it  by 
each  of  the  other  bodies  individually. 

These  laws,  which  appear  in  a  rather  unfamiliar  form,  are  actually  com- 
pletely equivalent  to  Newton's  laws,  as  stated  in  the  previous  section.  In 
view  of  the  apparently  fundamental  role  played  by  the  concept  of  force  in 
Newtonian  mechanics,  it  is  remarkable  that  we  have  been  able  to  reformu- 
late the  basic  laws  without  mentioning  this  concept.  It  can  of  course  be 
introduced,  by  defining  it  through  Newton's  second  law,  (1.1).  The  util- 
ity of  this  definition  arises  from  the  fact  that  forces  satisfy  Newton's  third 
law,  (1.3),  while  accelerations  satisfy  only  the  more  complicated  law,  (1.7). 
Since  the  mutually  induced  accelerations  of  two  given  bodies  are  always 
proportional,  they  are  essentially  determined  by  a  single  function,  and  it 
is  useful  to  introduce  the  more  symmetric  concept  of  force,  for  which  this 
becomes  obvious. 

It  is  interesting  to  note,  finally,  that  one  consequence  of  our  basic  laws 
is  the  additive  nature  of  mass.  Let  us  take  a  three-body  system.  Then, 
returning  to  the  notation  of  the  previous  section,  the  equations  of  motion 
for  the  three  bodies  are 


m3a3  =  F31  +  F32. 

If  we  add  these  equations,  then,  in  view  of  (1.3),  the  terms  on  the  right 
cancel  in  pairs,  and  we  are  left  with 


which  is  the  generalization  of  (1.7).  Now,  if  we  suppose  that  the  force 
between  the  second  and  third  is  such  that  they  are  rigidly  bound  together 
to  form  a  composite  body,  their  accelerations  must  be  equal:  a2  =  a3.  In 
that  case,  we  get 


miai  =  F12  +  F13, 
m2a2  =  F2i  +  F23, 


(1.9) 


m\a\  +  m2a2  +  m3a3  =  0, 


(1.10) 


roioi  =  —{m2  +  m3)a2, 


which  shows  that  the  mass  of  the  composite  body  is  just  77123  =  m2  +  rn-i- 
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1.4    External  Forces 

To  find  the  motion  of  the  various  bodies  in  any  dynamical  system,  we  have 
to  solve  two  closely  interrelated  problems.  First,  given  the  positions  and 
velocities  at  any  one  instant  of  time,  we  have  to  determine  the  forces  acting 
on  each  body.  Second,  given  the  forces  acting,  we  have  to  compute  the 
new  positions  and  velocities  after  a  short  interval  of  time  has  elapsed.  In 
a  general  case,  these  two  problems  are  inextricably  bound  up  with  each 
other,  and  must  be  solved  simultaneously.  If,  however,  we  are  concerned 
with  the  motions  of  a  small  body,  or  group  of  small  bodies,  then  we  can 
often  neglect  its  effect  on  other  bodies,  and  in  that  case  the  two  problems 
can  be  separated. 

For  example,  in  discussing  the  motion  of  an  artificial  satellite,  we  can 
clearly  ignore  its  effect  on  the  Earth.  Since  the  motion  of  the  Earth  is 
already  known,  we  can  calculate  the  force  on  the  satellite  as  a  function  of 
its  position  and  (if  atmospheric  resistance  is  included)  its  velocity.  Then, 
taking  the  force  as  known,  we  can  solve  separately  the  problem  of  its  motion. 
In  the  latter  problem,  we  are  really  concerned  with  the  satellite  alone.  The 
Earth  enters  simply  as  a  known  external  influence. 

In  many  cases,  therefore,  it  is  useful  to  concentrate  our  attention  on  a 
small  part  of  a  dynamical  system,  and  to  represent  the  effect  of  everything 
outside  this  by  external  forces,  which  we  suppose  to  be  known  in  advance, 
as  functions  of  position,  velocity  and  time.  This  is  the  kind  of  problem  with 
which  we  shall  be  mainly  concerned  in  the  next  few  chapters.  Typically, 
we  shall  consider  the  motion  of  a  particle  under  a  known  external  force. 
In  Chapter  6,  we  consider,  for  the  gravitational  and  electrostatic  cases, 
the  complementary  problem  of  determining  the  force  from  a  knowledge  of 
the  positions  of  other  bodies.  Later,  in  Chapter  7,  we  return  to  the  more 
complex  type  of  problem  in  which  the  system  of  immediate  interest  cannot 
be  taken  to  be  merely  a  single  particle. 


1.5  Summary 

To  some  extent,  the  selection  of  a  group  of  basic  concepts,  in  terms  of  which 
others  are  to  be  defined,  is  a  matter  of  choice.  We  have  chosen  to  regard 
position  and  time  (relative  to  some  frame  of  reference)  as  basic.  From  this 
point  of  view,  Newton's  laws  must  be  regarded  as  containing  definitions  in 
addition  to  physical  laws.  The  first  law  contains  the  definition  of  an  inertial 
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frame,  together  with  the  physical  assertion  that  such  frames  exist,  while  the 
second  and  third  laws  contain  definitions  of  mass  and  force.  These  laws, 
supplemented  by  the  laws  of  force,  such  as  the  law  of  universal  gravitation, 
provide  the  equations  from  which  we  can  determine  the  motion  of  any 
dynamical  system. 


Problems 

Note.  Here  and  in  later  chapters,  starred  problems  are  somewhat  harder. 

1.  An  object  A  moving  with  velocity  v  collides  with  a  stationary  object  B. 
After  the  collision,  A  is  moving  with  velocity  \v  and  B  with  velocity 
|w.  Find  the  ratio  of  their  masses.  If,  instead  of  bouncing  apart,  the 
two  bodies  stuck  together  after  the  collision,  with  what  velocity  would 
they  then  move? 

2.  The  two  components  of  a  double  star  are  observed  to  move  in  circles  of 
radii  r\  and  r-i.  What  is  the  ratio  of  their  masses?  (Hint:  Write  down 
their  accelerations  in  terms  of  the  angular  velocity  of  rotation,  ui.) 

3.  Consider  a  system  of  three  particles,  each  of  mass  m,  whose  motion  is 
described  by  (1.9).  If  particles  2  and  3,  even  though  not  rigidly  bound 
together,  are  regarded  as  forming  a  composite  body  of  mass  2m  located 
at  the  mid-point  r  =  \(r2+rz),  find  the  equations  describing  the  motion 
of  the  two-body  system  comprising  particle  1  and  the  composite  body 
(2  +  3).  What  is  the  force  on  the  composite  body  due  to  particle  1?  Show 
that  the  equations  agree  with  (1.7).  When  the  masses  are  unequal,  what 
is  the  correct  definition  of  the  position  of  the  composite  (2  +  3)  that  will 
make  (1.7)  still  hold? 

4.  Find  the  distance  r  between  two  protons  at  which  the  electrostatic 
repulsion  between  them  will  equal  the  gravitational  attraction  of  the 
Earth  on  one  of  them.  (Proton  charge  =  1.6  x  10-19  C,  proton  mass  = 
1.7  x  10"27kg.) 

5.  Consider  a  transformation  to  a  relatively  uniformly  moving  frame  of 
reference,  where  each  position  vector  1%  is  replaced  by  =  r*  —  vt. 
(Here  v  is  a  constant,  the  relative  velocity  of  the  two  frames.)  How  does 
a  relative  position  vector  r,j  transform?  How  do  momenta  and  forces 
transform?  Show  explicitly  that  if  equations  (1.1)  to  (1.4)  hold  in  the 
original  frame,  then  they  also  hold  in  the  new  one. 

6.  A  body  of  mass  50  kg  is  suspended  by  two  light,  inextensible  cables  of 
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lengths  15  m  and  20  m  from  rigid  supports  placed  25  m  apart  on  the  same 
level.  Find  the  tensions  in  the  cables.  (Note  that  by  convention  'light' 
means  'of  negligible  mass'.  Take  g  =  10ms-2.  This  and  the  following 
two  problems  are  applications  of  vector  addition.) 

7.  *An  aircraft  is  to  fly  to  a  destination  800  km  due  north  of  its  starting 
point.  Its  airspeed  is  800 kmh  .  The  wind  is  from  the  east  at  a  speed 
of  30ms_1.  On  what  compass  heading  should  the  pilot  fly?  How  long 
will  the  flight  take?  If  the  wind  speed  increases  to  50ms_1,  and  the 
wind  backs  to  the  north-east,  but  no  allowance  is  made  for  this  change, 
how  far  from  its  destination  will  the  aircraft  be  at  its  expected  arrival 
time,  and  in  what  direction? 

8.  *The  two  front  legs  of  a  tripod  are  each  1.4  m  long,  with  feet  0.8  m  apart. 
The  third  leg  is  1.5  m  long,  and  its  foot  is  1.5  m  directly  behind  the  mid- 
point of  the  line  joining  the  other  two.  Find  the  height  of  the  tripod,  and 
the  vectors  representing  the  positions  of  its  three  feet  relative  to  the  top. 
(Hint:  Choose  a  convenient  origin  and  axes  and  write  down  the  lengths 
of  the  legs  in  terms  of  the  position  vector  of  the  top.)  Given  that  the 
tripod  carries  a  weight  of  mass  2  kg,  find  the  forces  in  the  legs,  assuming 
they  are  purely  compressional  (i.e.,  along  the  direction  of  the  leg)  and 
that  the  legs  themselves  have  negligible  weight.  (Take  g  =  10ms"2.) 

9.  *Discuss  the  possibility  of  using  force  rather  than  mass  as  the  basic 
quantity,  taking  for  example  a  standard  weight  (at  a  given  latitude)  as 
the  unit  of  force.  How  should  one  then  define  and  measure  the  mass  of 
a  body? 

10.  The  first  estimate  of  Newton's  constant  was  made  by  the  astronomer 
Nevil  Maskelyne  in  1774  by  measuring  the  angle  between  the  direc- 
tions of  the  apparent  plumb-line  vertical  on  opposite  sides  of  the  Scot- 
tish mountain  Schiehallion  (height  1081  m,  chosen  for  its  regular  conical 
shape).  Find  a  rough  estimate  of  the  angle  through  which  a  plumb  line 
is  deviated  by  the  gravitational  attraction  of  the  mountain,  by  modelling 
the  mountain  as  a  sphere  of  radius  500m  and  density  2.7  x  103  kgm-3, 
and  assuming  that  its  gravitational  effect  is  the  same  as  though  the  total 
mass  were  concentrated  at  its  centre.  (This  latter  assumption  will  be 
justified,  for  a  spherical  object,  in  Chapter  6.) 
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Chapter  2 

Linear  Motion 


In  this  chapter  we  discuss  the  motion  of  a  body  which  is  free  to  move 
only  in  one  dimension.  The  problems  discussed  are  chosen  to  illustrate  the 
concepts  and  techniques  which  will  be  of  use  in  the  more  general  case  of 
three-dimensional  motion. 

2.1     Conservative  Forces;  Conservation  of  Energy 

We  consider  first  a  particle  moving  along  a  line,  under  a  force  which  is  given 
as  a  function  of  its  position,  F(x).  Then  the  equation  of  motion  (1.1)  is 


Since  this  equation  is  of  second  order  in  the  time  derivatives,  we  shall  have 
to  integrate  twice  to  find  X  cLS  £L  function  of  t.  Thus  the  solution  will  contain 
two  arbitrary  constants  that  may  be  fixed  by  specifying  the  initial  values 
of  x  and  x. 

When  the  force  depends  only  on  x,  we  can  always  find  a  'first  integral', 
a  function  of  x  and  x  whose  value  is  constant  in  time.  Let  us  consider  the 
kinetic  energy, 


mx  =  F(x). 


(2.1) 


mar 


4,2 


(2.2) 


Differentiating,  we  find  for  its  rate  of  change 


T  =  mix  =  F(x)x, 


(2.3) 


using  (2.1).  Integrating  with  respect  to  time,  we  find 


(2.4) 
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If  we  define  the  potential  energy, 


V{x)  =  -  F(x')dx', 


(2.5) 


Jxa 


we  can  write  (2.4)  in  the  form 


T  +  V  =  E  =  constant. 


(2.6) 


Here  xo  is  an  arbitrary  constant,  corresponding  to  the  arbitrary  additive 
constant  of  integration  in  (2.4).  There  is  a  corresponding  arbitrariness  up 
to  an  additive  constant  in  the  total  energy  E. 

Note  that  (2.5)  can  be  inverted  to  give  the  force  in  terms  of  the  potential 
energy: 


The  equation  (2.6)  is  the  law  of  conservation  of  energy.  A  force  of  this 
type,  depending  only  on  x,  is  called  a  conservative  force.  A  great  deal  of 
information  about  the  motion  can  be  obtained  from  this  conservation  law, 
even  without  integrating  again  to  find  x  explicitly  as  a  function  of  t.  If 
the  initial  position  and  velocity  are  given,  we  can  calculate  the  value  of  the 
constant  E.  Then  (2.6)  in  the  form 


gives  the  velocity  of  the  particle  (except  for  sign)  when  it  is  at  any  given 
position  x.  Since  the  kinetic  energy  is  obviously  positive,  the  motion  is 
confined  to  the  region  where 


An  example  may  help  to  explain  this. 

Example:  The  simple  pendulum 

A  simple  pendulum  comprises  a  bob  of  mass  m  supported  by  a 
light  rigid  rod  of  length  I.  (We  choose  a  rod  rather  than  a  string  so 
that  we  may  consider  motion  above  the  point  of  support.  By  'light' 
we  mean  'of  negligible  mass'.)  The  bob  starts  with  velocity  v  from 
the  equilibrium  position.  What  kinds  of  motion  are  possible  for 
different  values  of  v? 


F(x) 


(2.7) 


\rrix1  =E-  V(x) 


(2.8) 


V(x)  <  E. 
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The  distance  moved  by  the  bob  is  x  =  19,  where  9  is  the  angular  dis- 
placement from  the  downward  vertical.  Corresponding  to  the  restoring 
force  F  =  —mgsm9  =  —mgsm(x/l),  the  potential  energy  function  is 

V  =  mgl[l  -  cos(x/l)]  =  mgl{\  -  cos0).  (2.9) 

It  is  plotted  in  Fig.  2.1.  (Note  that  the  points  9  =  -n  and  9  =  —n  may 
be  identified.) 


V ' 

 »  

E(2) 

\-tnv 

\  £(D 

V — f — 

— i — y 

-%  -e0 

e0  7i  e 

Fig.  2.1 


Initially  9  =  0,  and  the  bob  is  moving  with  velocity  v.  Since  we  have 
chosen  the  arbitrary  constant  in  V  so  that  V(0)  =  0,  the  total  energy  is 
E  =  \mv2.  Now  if  E  <  2mgl  (E^  in  the  figure),  the  motion  will  be 
confined  between  the  two  angles  ±9q  where  ^(6*0)  =  E,  i.e. 

1  —  cos#o  =  v2/2gl. 

These  are  the  points  where  the  kinetic  energy  vanishes,  so  that  the  pen- 
dulum bob  is  instantaneously  at  rest.  The  motion  is  an  oscillation  of 
amplitude  90. 

On  the  other  hand,  if  the  bob  is  pushed  so  hard  that  E  >  2mgl  (-E(2) 
in  the  figure) ,  then  the  kinetic  energy  will  never  vanish.  When  the  bob 
reaches  the  upward  vertical,  it  still  has  positive  energy,  namely 

imo'2  =  E  —  2mgl  =  ^mv2  —  2mgl. 

In  this  case,  the  motion  is  a  continuous  rotation  rather  than  an  oscilla- 
tion. 
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2.2    Motion  near  Equilibrium;  the  Harmonic  Oscillator 

A  particle  can  be  in  equilibrium  only  if  the  total  force  acting  on  it  is  zero. 
For  a  conservative  force  this  means,  by  (2.7),  that  the  potential  energy  curve 
is  horizontal  at  the  position  of  the  particle.  Let  us  now  consider  the  motion 
of  a  particle  near  a  position  of  equilibrium.  Without  loss  of  generality,  we 
may  choose  the  equilibrium  point  to  be  the  origin,  x  =  0,  and  choose  the 
arbitrary  constant  in  V  so  that  V(0)  =  0.  If  we  are  interested  only  in  small 
displacements,  we  may  expand  V(x)  in  a  Maclaurin-Taylor  series, 

V{x)  =  V(Q)  +  xV'(0)  +  \x2V"(Q)  +  ■■■  , 

where  the  primes  denote  differentiation  with  respect  to  x.  Since  we  have 
chosen  1^(0)  =  0,  the  constant  term  is  absent,  and  since  the  equilibrium 
condition  is  V'(0)  =  0,  the  linear  term  is  absent  also.  Thus  near  x  =  0  we 
can  write,  approximately, 

V(x)  =  \kx2,        k  =  V"(0).  (2.10) 

(We  assume  for  simplicity  that  V"(0)  does  not  also  vanish.) 

Because  motion  near  almost  any  point  of  equilibrium  is  described  ap- 
proximately by  this  potential  energy  function,  it  is  remarkably  ubiquitous; 
it  plays  as  important  a  role  in  quantum  mechanics  as  in  classical  mechanics. 
It  will  therefore  be  useful  to  analyze  it  in  some  detail.  The  potential  energy 
curve  here  is  a  parabola  (see  Fig.  2.2).  Thus  for  k  >  0  and  any  energy 
E  >  0  (Fig.  2.2(a)),  there  are  two  points  where  V(x)  =  E,  namely 

x  =  ±a,       a  =  yJlElk.  (2.11) 

The  motion  is  an  oscillation  between  these  two  points. 

On  the  other  hand,  if  k  <  0,  the  curve  is  an  inverted  parabola  (see 
Fig.  2.2(b)).  In  this  case,  two  kinds  of  motion  are  possible.  If  E  <  0  the 
particle  may  approach  to  some  minimum  distance,  where  it  comes  momen- 
tarily to  rest  before  reversing  direction  {E(2)  in  figure).  But  if  E  >  0  (-E(i) 
in  figure),  it  has  enough  energy  to  surmount  the  barrier  and  will  never 
come  to  rest:  it  will  continue  moving  in  the  same  direction  for  ever,  with 
decreasing  speed  as  it  approaches  x  =  0  and  then  increasing  speed. 

The  force  corresponding  to  the  potential  energy  function  (2.10)  is,  by 
(2-7), 


F(x)  =  -kx. 


(2.12) 
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Fig.  2.2 


It  is  an  attractive  or  repulsive  force,  according  as  k  >  0  or  k  <  0,  propor- 
tional to  displacement  from  the  point  of  equilibrium. 
The  equation  of  motion  may  be  written 


This  equation  is  very  easy  to  solve.  We  could  proceed  directly  from  the 
energy  conservation  equation  in  the  form  (2.8)  and  integrate  again  to  obtain 


However,  since  we  shall  encounter  a  number  of  similar  equations  later,  it  will 
be  useful  to  discuss  an  alternative  method  of  solution  that  can  be  adapted 
later  to  other  examples. 

Solution  of  the  harmonic  oscillator  equation 

Equation  (2.13)  is  a  linear  differential  equation;  that  is,  one  involving  only 
linear  terms  in  x  and  its  derivatives.  Such  equations  have  the  important 
property  that  their  solutions  satisfy  the  superposition  principle:  if  X\(t)  and 
X2{t)  are  solutions,  then  so  is  any  linear  combination 


mx  +  kx  =  0. 


(2.13) 


(2.14) 


x(t)  =  (ixx^t)  +  a2x2{t), 


(2.15) 


where  cii  and  a2  are  constants;  for,  clearly, 


mx,  +  kx  =  ai(rnxi  +  kxi)  +  0,2(171x2  +  kx2)  =  0. 
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Moreover,  if  x\  and  X2  are  linearly  independent  solutions  (that  is,  unless 
one  is  simply  a  constant  multiple  of  the  other),  then  (2.15)  is  actually 
the  general  solution.  For,  since  (2.13)  is  of  second  order,  we  could  obtain 
its  solution  by  integrating  twice,  and  the  general  solution  must  therefore 
contain  just  two  arbitrary  constants  of  integration.  So  to  find  the  general 
solution,  all  we  have  to  do  is  to  find  any  two  independent  solutions  X\{t) 
and  X2(t). 

Let  us  consider  first  the  case  where  k  <  0,  so  that  V(x)  has  a  maximum 
at  x  =  0.  Then  (2.13)  can  be  written 

x  —  p2x  =  0,       p  =  yj—k/m.  (2-16) 

It  is  easy  to  verify  that  this  equation  is  satisfied  by  the  functions  x  =  ept 
and  x  =  e~pt.  Thus  the  general  solution  is 

x  =  \Aept  +  ±Be~pt.  (2.17) 

(We  introduce  the  factor  of  |  for  later  convenience;  it  is  a  matter  of  con- 
vention whether  we  call  the  arbitrary  constants  A  and  B  or  ^A  and  \B.) 
Clearly,  a  small  displacement  will  in  general  lead  to  an  exponential  increase 
of  x  with  time,  which  continues  until  the  approximation  involved  in  (2.10) 
ceases  to  be  valid.  Thus  the  equilibrium  is  unstable,  as  we  should  expect 
when  V  has  a  maximum. 

We  now  turn  to  the  case  where  k  >  0  and  V  has  a  minimum  at  x  =  0. 
Then  the  potential  energy  function  (2.10)  is  that  of  a  simple  harmonic 
oscillator.  The  equation  of  motion  (2.13)  becomes 

x  +  uj2x  =  0,        w  =  yjk/m.  (2.18) 

It  is  again  very  easy  to  check  that  the  functions  x  =  cos  wt  and  x  =  sin  uit 
are  solutions,  and  the  general  solution  is  therefore 

x  =  ccos  wt  +  dsmwt.  (2-19) 

The  arbitrary  constants  c  and  d  are  to  be  determined  by  the  initial  con- 
ditions. If  at  t  =  0  the  particle  is  at  xq  with  velocity  vq,  then  we  easily 
find 

c  =  xq,       d  =  vo/w. 
An  alternative  form  of  (2.19)  is 


x  =  acos(wt  —  6), 


(2.20) 
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where  the  constants  a,  9  are  related  to  c,  d  by 

c  =  acos6*,       d  =  asm8. 

The  constant  a  is  called  the  amplitude.  It  is  identical  to  the  constant  intro- 
duced in  (2.11),  and  defines  the  limits  between  which  the  particle  oscillates, 
x  =  ±a.  The  motion  is  a  periodic  oscillation,  of  period  r  given  by 


t  =  2-k/u). 


(2.21) 


(See  Fig.  2.3.)  The  frequency  f  is  the  number  of  oscillations  per  unit  time, 
namely 


/ 


t  2tt' 


(2.22) 


Fig.  2.3 


Notice  that  this  discussion  applies  to  the  motion  of  a  particle  near  a 
stable  equilibrium  point  of  any  potential  energy  function.  For  sufficiently 
small  displacements,  any  system  of  this  kind  behaves  like  a  simple  harmonic 
oscillator.  In  particular,  the  period  or  frequency  of  small  oscillations  may 
always  be  found  from  the  second  derivative  of  the  potential  energy  function 
at  the  equilibrium  position. 

Example:  Charged  particle  between  two  fixed  charges 

A  particle  of  charge  q  is  constrained  to  move  along  a  line  between 
two  other  equal  charges  q,  fixed  at  x  =  ±a.  What  is  the  period  of 
small  oscillations? 
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Here,  the  force  (in  the  region  \x\  <  a)  is 


2  2 

qz  qz 


4neo(a  +  x)2      47reo(a  —  x)2  ' 
corresponding  to  the  potential  energy  function 

y=i-f_L+.  1  >t  2a 


47reo  \a  +  2)     a  —  x J      Attcq  \ a2 


x' 


Clearly,  the  position  of  equilibrium,  where  V'  =  0,  is  x  =  0,  and  we 
easily  find 


k  =  V"(0) 


q2a    a2  +  3x2 


q 


2 


x=0       ""c0O3  ' 


7reo  (a2  —  x2)3 
Hence,  u2  =  q2 /ireoa3m,  and  r  =  (2w/q)\/ireoa^m. 

2.3    Complex  Representation 

It  is  often  very  convenient,  in  discussing  periodic  phenomena,  to  use  com- 
plex numbers.  In  particular,  this  allows  us  to  treat  both  cases,  k  >  0  and 
k  <  0,  of  the  previous  section  together.  If  we  allow  p  to  be  complex,  then 
the  solution  in  both  cases  is  given  by  (2.16)  and  (2.17).  When  k  >  0,  p 
becomes  purely  imaginary,  p  =  \u>,  and  the  solution  (2.17)  is 

x  =  \Aelut  +  \Be~iujt.  (2.23) 

Of  course,  x  must  be  a  real  number,  so  that  A  and  B  must  be  complex 
conjugates.  If  we  write 

A  =  c  —  id,        B  =  c  +  id, 

and  use  the  relation 

elujt  =  cos  u>t  +  i  sin  tut, 

we  recover  the  solution  (2.19).  Similarly,  if  we  use  the  polar  form  of  a 
complex  number, 

,4  =  ae-ie,        B  =  aew, 


we  obtain  the  solution  in  the  form  (2.20). 
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It  is  also  useful  to  note  that  if  z  =  x  +  iy  is  a  complex  solution  of  (2.13), 

mz  +  kz  =  (mx  +  kx)  +  i(my  +  ky)  =  0, 

then  its  real  and  imaginary  parts,  x  =  Re(z)  and  y  =  lm.(z),  must  sepa- 
rately each  satisfy  the  equation.  This  method  of  obtaining  solutions  will 
be  useful  later. 

If  we  plot  the  complex  solution, 

z  =  x  +  iy  =  Aeiut, 

in  the  icy-plane,  we  see  that  z  moves  in  a  circle  around  the  origin  with 
angular  velocity  ui  (see  Fig.  2.4).  For  this  reason,  w  is  usually  called  the 


Fig.  2.4 

angular  frequency.  The  constant  A  is  a  complex  amplitude,  whose  absolute 
value  (the  real  amplitude  a)  is  the  radius  of  the  circle,  and  whose  phase 
9  defines  the  initial  direction  of  the  vector  from  the  origin  to  z.  The  real 
part  of  the  solution,  x,  may  be  regarded  as  the  projection  of  this  circular 
motion  on  the  real  axis. 

2.4    The  Law  of  Conservation  of  Energy 

This  law  was  originally  derived  from  Newton's  laws  for  the  case  where  the 
force  is  a  function  only  of  x.  It  has,  however,  a  much  wider  application. 
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By  introducing  additional  forms  of  energy  (heat,  chemical,  electromagnetic 
and  so  on),  it  has  been  extended  far  beyond  the  field  of  mechanics,  to  the 
point  where  it  is  now  recognized  as  one  of  the  most  fundamental  of  all 
physical  laws.  The  existence  of  such  a  law,  and  of  the  laws  of  conservation 
of  momentum  and  angular  momentum,  is  in  fact  closely  related  to  the 
relativity  principle,  as  we  shall  see  later  in  Chapter  12. 

Many  non-conservative  (or  dissipative)  forces  may  be  regarded  as  macro- 
scopic effects  of  forces  which  are  really  conservative  on  a  small  scale.  For 
example,  when  a  particle  penetrates  a  retarding  medium,  such  as  the  at- 
mosphere, it  experiences  a  force  which  is  velocity-dependent,  and  therefore 
non-conservative.  However,  if  we  look  at  the  situation  on  a  sub-microscopic 
scale,  we  see  that  what  happens  is  that  the  particle  makes  a  series  of  col- 
lisions with  the  molecules  of  the  medium.  In  each  collision,  energy  is  con- 
served, and  some  of  the  kinetic  energy  of  the  incoming  particle  is  transferred 
to  the  molecules  with  which  it  collides.  (Such  collisions  will  be  discussed  in 
detail  in  Chapter  7.)  By  means  of  further  collisions,  this  energy  is  gradually 
distributed  among  the  surrounding  molecules.  The  net  result  is  to  retard 
the  incoming  particle,  and  to  increase  the  average  energy  of  the  molecules 
in  the  medium.  This  increased  energy  appears  macroscopically  as  heat,  and 
results  in  a  rise  in  the  temperature  of  the  medium. 

For  an  arbitrary  force  F,  the  rate  of  change  of  the  kinetic  energy  T  is 
given  by  (2.3):  T  =  Fx.  Thus  the  increase  in  kinetic  energy  in  a  time 
interval  dt,  during  which  the  particle  moves  a  distance  da;,  is 

dT  =  dW,  (2.24) 

where 

dW  =  Fdx  (2.25) 

is  called  the  work  done  by  the  force  F  in  the  infinitesimal  displacement 
dx.  (Thus  for  a  conservative  force,  the  potential  energy  V(x)  is  equal 
to  minus  the  work  done  by  the  force  in  the  displacement  from  the  fixed 
point  xq  to  x.)  The  work  is  therefore  a  measure  of  the  amount  of  energy 
converted  to  kinetic  energy  from  other  forms,  and  the  rate  P  of  doing  work 
(the  power)  is  defined  as  Fx.  In  a  real  mechanical  system,  there  is  usually 
some  loss  of  mechanical  (kinetic  or  potential)  energy  to  heat  or  other  forms. 
Correspondingly,  there  will  be  dissipative  forces  acting  on  the  system.  (Of 
course,  it  is  also  possible  to  convert  heat  to  mechanical  energy,  as  in  an 
internal  combustion  engine.) 
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2.5    The  Damped  Oscillator 


We  saw  in  §2.2  that  a  particle  near  a  position  of  stable  equilibrium  under 
a  conservative  force  may  always  be  treated  approximately  as  a  simple  har- 
monic oscillator.  If  there  is  energy  loss,  we  must  include  in  the  equation 
of  motion  a  force  depending  on  the  velocity.  So  long  as  we  are  concerned 
only  with  small  displacements  from  the  equilibrium  position,  we  may  treat 
both  x  and  x  as  small  quantities,  and  neglect  x2 ,  xx  and  x2  in  comparison. 
Thus  we  are  led  to  consider  the  damped  harmonic  oscillator  for  which 


F  =  —kx  —  Xx, 


where  A  is  another  constant.  The  equation  of  motion  now  becomes 

mx  +  Xx  +  kx  =  0.  (2.26) 

Equations  of  this  form  turn  up  in  many  branches  of  physics.  For  example, 
the  oscillations  of  an  electrical  circuit  containing  an  inductance  L,  resistance 
R  and  capacitance  C  in  series  are  described  by  the  equation 

Lq  +  Rq+^q  =  0, 

in  which  the  variable  q  represents  the  charge  on  one  plate  of  the  condenser. 

The  equation  (2.26)  may  be  solved  as  before  by  looking  for  solutions  of 
the  form  x  =  ept.  Substituting  in  (2.26),  we  obtain  for  p  the  equation 

rap2  +  Xp  +  k  =  0, 


whose  roots  are 


p=-7±J12-cu2,  (2.27) 

where 

7  =  —,    and    uo  =  d  —  .  2.28 
2m  V  m 

Note  that  u>o  is  the  frequency  of  the  undamped  oscillator,  as  in  (2.18).  The 
rate  at  which  work  is  done  by  the  force  —  Xx  is  —  Xx2.  If  A  were  negative, 
the  particle  would  therefore  be  gaining,  rather  than  losing,  energy.  So  we 
shall  assume  that  A  is  positive. 
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Large  damping 

If  A  is  so  large  that  7  >  ujq,  then  both  roots  for  p  are  real  and  negative: 


P  =  ~7±.        7±  =  7  ±  \Jl2  -wg. 

The  general  solution  is  then 

a;  =  iJ4e-7+t  +  iBe"7"*,  (2.29) 

where  ^4  and  _B  are  arbitrary  constants.  Hence  the  displacement  tends 
exponentially  to  zero.  For  large  times,  the  dominant  term  is  that  containing 
in  the  exponent  the  smaller  coefficient,  7_.  Thus  the  characteristic  time  in 
which  the  displacement  x  is  reduced  by  a  factor  1/e  is  of  order  l/7_. 

Small  damping 

Now  let  us  consider  the  case  where  A  is  small,  so  that  7  <  u>o-  Then  the 
two  roots  for  p  are  complex  conjugates, 


p  =  -j±iu,        w  =  ycuo  -  72-  (2.30) 

The  general  solution  may  be  written  in  the  alternative  forms 

x  =  iyle^'-T*  +  lBe-iwt-7t 
=  Re(,4ei"t-7t) 

=  ae-~(tcos{ujt-6),  (2.31) 

where 

A  =  ae~ie,        B  =  aeie. 

From  the  last  form  of  (2.31)  we  see  that  this  solution  represents  an  oscilla- 
tion with  exponentially  decreasing  amplitude  ae~7*,  and  angular  frequency 
ui.  (See  Fig.  2.5.)  Note  that  ui  is  always  less  than  the  frequency  ujq  of  the 
undamped  oscillator.  The  time  in  which  the  amplitude  is  reduced  by  a 
factor  1/e,  called  the  relaxation  time  of  the  oscillator,  is 

1/7  =  2m/ X. 

It  is  often  convenient  to  introduce  the  quality  factor,  or  simply  'Q\  of 
the  oscillator,  defined  as  the  dimensionless  number 
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Fig.  2.5 

If  the  damping  is  small  then  Q  is  large.  In  a  single  oscillation  period,  the 
amplitude  is  reduced  by  the  factor  e-2""7/",  or  approximately  e-71"/^.  The 
number  of  periods  in  a  relaxation  time  is  roughly  Q/n. 

Critical  damping 

The  limiting  case,  7  =  ujq,  is  the  case  of  critical  damping,  in  which  w  =  0 
and  the  two  roots  forp  coincide.  Then  the  solution  (2.29)  or  (2.31)  involves 
only  one  arbitrary  constant,  A  +  B,  and  cannot  be  the  general  solution.  We 
need  to  find  a  second  independent  solution  in  this  case.  In  fact,  it  is  easy 
to  verify  by  direct  substitution  that  the  equation  (2.26)  in  this  special  case 
is  also  satisfied  by  the  function  te~pt.  Thus  the  general  solution  is 

x  =  (a  +  bt)e-^.  (2.33) 

Critical  damping  is  often  the  ideal.  For  example,  in  a  measuring  instru- 
ment, we  want  to  damp  out  the  oscillations  of  the  pointer  about  its  correct 
position  as  quickly  as  possible,  but  too  much  damping  would  lead  to  a  very 
slow  response.  Let  us  assume  that  k  is  fixed  and  the  amount  of  damping 
varied.  When  the  damping  is  less  than  critical  (7  <  u>o),  the  characteristic 
time  of  response  is  the  relaxation  time  I/7,  which  of  course  decreases  as 
7  is  increased.  However,  when  7  >  luq,  the  characteristic  time  is  I/7-,  as 
we  noted  above.  It  is  not  hard  to  verify  that  as  7  increases,  7_  decreases, 
so  that  the  response  time  1/7-  increases  (see  Fig.  2.6).  Thus  the  shortest 
possible  response  time  is  obtained  by  choosing  7  =  uq,  that  is  for  critical 
damping. 
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Fig.  2.6 


2.6    Oscillator  under  Simple  Periodic  Force 

In  an  isolated  system  the  forces  are  functions  of  position  and  velocity,  but 
not  explicitly  of  the  time.  However,  we  are  often  interested  in  the  response 
of  an  oscillatory  system  to  an  applied  external  force,  which  is  given  as  a 
function  of  the  time,  F(t).  Then  we  have  to  deal  with  the  equation 


Now,  if  xi(t)  is  any  solution  of  this  equation,  and  xo(i)  is  a  solution  of 
the  corresponding  homogeneous  equation  (2.26)  for  the  unforced  oscillator, 
then,  as  one  can  easily  check  by  direct  substitution,  Xi(t)  +  Xo(t)  will  be 
another  solution  of  (2.34).  Hence  we  only  need  to  find  one  particular  so- 
lution of  this  equation.  Its  general  solution  is  then  obtained  by  adding  to 
this  particular  solution  the  general  solution  of  the  homogeneous  equation 
(2.26).  (This  will  be  the  general  solution  because  it  contains  two  arbitrary 
constants.) 

Let  us  consider  first  the  case  where  the  applied  force  is  periodic  in  time, 
with  the  simple  form 


where  F\  and  u>i  are  real  constants.  It  is  convenient  to  write  this  in  the 
form 


mi  +  Xx  +  kx  =  F(t). 


(2.34) 


F(t)  =  Fi  cosujit, 


(2.35) 


F(t)  =  Re(Fieiuit), 
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and  to  solve  first  the  equation  with  a  complex  force, 

mz  +  Xz  +  kz  =  Fieiuit.  (2.36) 

Then  the  real  part  x  of  this  solution  will  be  a  solution  of  the  equation  (2.34) 
with  the  force  (2.35). 

We  now  look  for  a  particular  solution  of  (2.36)  which  is  periodic  in  time 
with  the  same  period  as  the  applied  force, 

2  =  J41ei"1'  =  a1ei^lt-ei), 

where  A\  =  ai&~101  is  a  complex  constant.  Substituting  in  (2.36),  we  obtain 
an  equation  for  A\: 

{-mJi  +  iAwi  +  k)Ax  =  iq.  (2.37) 
Dividing  by  me"lfll  and  rearranging  the  terms  gives 

{ojI  +  2i7wi  -  w?)ai  =  —  ei£\ 
m 

where  7  and  loq  are  defined  as  before  by  (2.28).  Hence,  equating  real  and 
imaginary  parts, 

(cjq  —  Lt)\)a\  =  (Fi/m)  cos^i, 
27Wiai  =  (Fi /m)  sin  0i . 

The  amplitude  is  found  by  squaring  these  equations  and  adding,  to  give 

Fj/m 

0,1  ~       I,    9  9X9        ,9     9-  (2.6$) 

Dividing  one  equation  by  the  other  yields  the  phase: 

tan  fli  =    l1UJl  „.  (2.39) 
tol  -  uj\ 

If  F\  >  0,  the  correct  choice  between  the  two  solutions  for  0i  differing  by 
7r  is  the  one  lying  in  the  range  0  <  0i  <  ir. 

We  have  found  a  particular  solution  of  (2.36).  Its  real  part,  the  corre- 
sponding particular  solution  of  our  original  equation  (2.34),  is  simply 

x  =  ai  cos(wi£  -  0i).  (2.40) 
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The  general  solution,  as  we  discussed  above,  is  obtained  by  adding  to  this 
particular  solution  the  general  solution  of  the  corresponding  homogeneous 
equation,  namely  (2.29)  or  (2.31).  In  the  case  where  the  damping  is  less 
than  critical  (7  <  wo),  we  obtain 

x  =  a\  cos{u)it  -  61)  +  ae~Jt  cos(uit  -  6).  (2-41) 

Here,  of  course,  a\  and  81  are  given  by  (2.38)  and  (2.39),  but  a  and  9  are 
arbitrary  constants  to  be  fixed  by  the  initial  conditions. 

The  second  term  in  the  general  solution  (2.41),  which  represents  a  free 
oscillation,  dies  away  exponentially  with  time.  It  is  therefore  called  the 
transient.  After  a  long  time,  the  displacement  x  will  be  given  by  the  first 
term  of  (2.41),  that  is,  by  (2.40),  alone.  Thus,  no  matter  what  initial 
conditions  we  choose,  the  oscillations  are  ultimately  governed  solely  by  the 
external  force.  Note  that  their  period  is  the  period  of  the  applied  force,  not 
the  period  of  the  unforced  oscillator. 


Resonance 

The  amplitude  a>i  and  phase  8\  of  the  forced  oscillations  are  strongly  de- 
pendent on  the  angular  frequencies  uiq  and  ui± .  In  particular,  if  the  damping 
is  small,  the  amplitude  can  become  very  large  when  the  frequencies  are  al- 
most equal.  If  we  fix  7  and  the  forcing  frequency  u>i,  and  vary  the  oscillator 
frequency  u>q,  the  amplitude  is  a  maximum  when  u>a  =  u>i.  In  this  case,  we 
say  that  the  system  is  in  resonance.  At  resonance,  the  amplitude  is 

1      2m7u;i      Xu>i '  ^  ^ 

which  can  be  very  large  if  the  damping  constant  A  is  small.  If,  on  the 
other  hand,  we  fix  the  parameters  7  and  ujo  of  the  oscillator,  and  vary 
the  forcing  frequency  uji,  the  maximum  amplitude  actually  occurs  for  a 
frequency  slightly  lower  than  u>q,  namely 


^0  "  272 


However,  if  7  is  small,  this  does  not  differ  much  from  u>q.  Note  that  the 
natural  frequency  u>  of  the  oscillator  lies  between  this  resonant  frequency 
and  the  frequency  ojq  of  the  undamped  oscillator. 

Near  resonance,  the  dependence  of  the  amplitude  a\  on  the  forcing 
frequency  uj\  is  of  the  form  illustrated  in  Fig.  2.7(a).  The  width  of  the 
resonance,  that  is  the  range  of  frequencies  over  which  the  amplitude  is  large, 


Linear  Motion 


33 


CO, 


'0 


CO 


CO, 


0 


CO 


(a) 


(b) 


Fig.  2.7 


is  determined  by  7.  For,  the  amplitude  is  reduced  to  1/^/2  of  its  peak  value 
when  the  two  terms  in  the  denominator  of  (2.38)  become  roughly  equal, 
and  for  small  7  this  occurs  when  w\  =  ujq  ±  7.  Therefore,  7  is  called  the 
half-width  of  the  resonance.  Notice  the  inverse  relationship  between  the 
width  and  peak  amplitude:  the  narrower  the  resonance,  the  higher  is  its 
peak. 

The  quality  factor  Q  =  uJo/^l  provides  a  quantitative  measure  of  the 
sharpness  of  the  resonance  peak.  Indeed,  the  ratio  of  the  amplitude  at 
resonance,  given  by  (2.42),  to  the  amplitude  at  uii  =  0  (which  describes  the 
response  to  a  time-independent  force)  is  precisely  Q. 

The  phenomenon  of  resonance  occurs  with  any  oscillatory  system,  and 
is  of  great  practical  importance.  Since  quite  small  forces  can  set  up  large 
oscillations  if  the  frequencies  are  in  resonance,  great  care  must  be  taken  in 
the  design  of  any  mechanical  structure  to  avoid  this  possibility.  This  led  to 
problems  for  example  with  the  original  design  of  the  Millennium  Bridge  in 
London. 

The  constant  9\  specifies  the  phase  relation  between  the  applied  force 
and  the  induced  oscillations.  If  the  force  is  slowly  oscillating,  uji  is  small, 
and  81  en  0,  so  that  the  induced  oscillations  are  in  phase  with  the  force.  In 
that  case,  the  amplitude  (2.38)  is 


Tims  the  position  x  at  any  time  t,  given  by  (2.40),  is  approximately  the 


k  ' 
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equilibrium  position  under  the  force  Ficosujit  —  kx.  At  resonance,  the 
phase  shift  is  9\  =  ^n,  and  the  induced  oscillations  lag  behind  by  a  quarter 
period.  (The  variation  of  6\  with  uj\  is  shown  in  Fig.  2.7(b).)  For  very 
rapidly  oscillating  forces,  6\  fa  tt,  and  the  oscillations  are  almost  exactly 
out  of  phase.  In  this  limiting  case,  a\  fa  Fi/mojf  and  the  oscillations 
correspond  to  those  of  a  free  particle  under  the  applied  oscillatory  force. 
Note  that  the  value  of  the  damping  constant  is  important  only  in  the  region 
near  the  resonance. 

2.7    General  Periodic  Force 

The  solution  we  have  obtained  can  immediately  be  generalized  to  the  case 
where  the  applied  force  is  a  sum  of  periodic  terms, 

F(t)  =  Fie1"1'  +  F2eiuJ2t  +  •  •  •  =  ^  Freiav*.  (2.43) 

T 

We  can  always  ensure  that  this  force  is  real  by  including  with  each  term  its 
complex  conjugate,  so  it  is  not  really  necessary  to  write  Re  in  front  of  the 
sum.  (In  particular,  a  simple  periodic  force  may  be  written  as  a  sum  of  two 
terms  proportional  to  e1"'  and  e~lujt.)  Because  of  the  linearity  of  (2.34), 
the  corresponding  solution  is 

x  =       AreluA  +  transient,  (2.44) 

r 

where  each  Ar  is  related  to  the  corresponding  Fr  by  (2.37).  This  may  easily 
be  verified  by  direct  substitution  in  (2.34). 
In  particular,  let  us  take 

oo 

F(t)=  F^mUt'  (2-45) 

n=  —  oo 

where  n  runs  over  all  integers,  positive,  negative  and  zero.  Here  u>  represents 
a  fundamental  angular  frequency  and  the  frequencies  nui  are  its  harmonics. 
Each  term  in  the  sum  is  unchanged  by  the  replacement  t  — >  t  +  2tt/u>,  so 
F(t)  is  clearly  periodic  in  the  sense  that 

F(t  +  t)  =  F(t),    where    r  =  2tt/lj.  (2.46) 

In  fact,  any  periodic  force  with  period  r  can  be  expressed  in  the  form  (2.45), 
which  is  called  a  Fourier  series.  If  we  are  given  F(t)  it  is  easy  to  find  the 
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corresponding  coefficients  Fn.  For  one  easily  sees  that 


T  Jo 


m)tdt  =  5n 


(2.47) 


where  6nm  is  Kronecker's  delta  symbol  defined  by 


1  if  n  =  m, 
0    if  n  ^  m. 


(2.48) 


Thus  if  we  multiply  (2.45)  by  e  lmujt  and  integrate  over  a  period,  the  only 
term  in  the  sum  which  survives  is  the  one  with  n  =  rn.  Hence  we  obtain 


if 


F(t)e~ 


'dt. 


(2.49) 


A  simple  example  involves  a  force  with  'square-wave'  form. 


Example:  Square-wave  force 

Find  the  position  after  a  long  time  of  an  oscillator  subjected  to  a 
force  with  the  square-wave  form  shown  in  Fig.  2.8,  and  given  by 


F 


0 
—F 


F(t) 


Fig.  2.8 


F,      rr  <t<{r+\ 


t, 


-F,    (r  -\)t  <t<  rr, 


for  r  =  0,±1,±2, 


36 


Classical  Mechanics 


Since  uj  =  2tv/t,  we  find 

,r/2 


f  r 

T  Jr/2 

[!-(-!)»] 


F„  =  -  /      e-27ri"*/Tdt  -  -  /  e-2™*/rdt 


T 

mn ' 

Note  that  F„  =  0  for  all  even  values  of  n,  because  the  expression  in 
square  brackets  vanishes.  This  happens  because  in  addition  to  repeating 
after  the  period  r,  F(t)  repeats  with  opposite  sign  after  half  this  period: 
F(t  +  \t)  =  -F(t).  For  odd  values  of  n,  we  find  from  (2.37)  that  the 
amplitudes  An  are 

2F 

A„  =  T-2  o    2   i   o-  V  °dd)- 

The  position  of  the  oscillator  is  then  given  by 

oo 

x=   J2  A2r+iel(2r+1)uJt  +  transient. 

r=  —  oo 

After  a  long  time,  the  transient  can  be  ignored,  and  the  position  is  given 
by  the  sum  alone. 

Note  that  if  the  damping  is  small  and  ujq  is  close  to  one  of  the  values 
nuj,  then  one  pair  of  amplitudes,  An  and  A_n  will  be  much  larger  than 
the  rest,  and  the  oscillation  will  be  almost  simple  harmonic  motion  at  this 
frequency.  In  general,  for  large  n,  the  amplitudes  An  decrease  rapidly  in 
magnitude  as  n  increases,  so  that  we  can  obtain  a  good  approximation  to 
x  by  keeping  only  a  few  terms  of  the  series. 

In  this  way,  one  can  obtain  the  solution  to  (2.34)  for  an  arbitrary  pe- 
riodic force.  As  a  matter  of  fact,  this  kind  of  treatment  can  be  extended 
even  further,  by  replacing  the  sum  in  (2.45)  by  an  integral  over  all  pos- 
sible angular  frequencies  w.  Any  force  whatever,  subject  to  some  very 
general  mathematical  requirements,  can  be  written  in  this  form.  (This  is 
the  Fourier  integral  theorem.)  However,  we  shall  not  pursue  this  method, 
because  we  shall  find  in  the  next  section  an  alternative  method  of  solving 
the  problem,  which  is  often  simpler  to  use. 
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2.8    Impulsive  Forces;  the  Green's  Function  Method 

There  are  many  physical  situations,  for  example  collisions,  in  which  very 
large  forces  act  for  very  short  times.  Let  us  consider  a  force  F  acting  on 
a  particle  during  the  time  interval  At.  The  resulting  change  of  momentum 
of  the  particles  is 

/•t+At 

Ap  =  p(t  +  At)  -  p(t)  =  J        Fdt.  (2.50) 

The  quantity  on  the  right  is  called  the  impulse  I  delivered  to  the  particle. 
It  is  natural  to  consider  an  idealized  situation  in  which  the  time  interval 
At  tends  to  zero,  and  the  whole  of  the  impulse  I  is  delivered  to  the  particle 
instantaneously.  In  other  words,  we  let  At  — >  0  and  F  — >  oo  in  such  a 
way  that  the  impulse  /  remains  finite.  At  the  instant  when  the  impulse  is 
delivered  the  momentum  of  the  particle  changes  discontinuously.  Although 
in  reality  the  force  must  always  remain  finite,  this  is  a  good  description  of, 
for  example,  the  effect  of  a  sudden  blow. 

In  the  next  section,  we  shall  discuss  some  simple  collision  problems. 
Here,  we  shall  consider  the  effect  of  an  impulsive  force  on  an  oscillator.  Let 
us  suppose  first  that  the  oscillator  is  at  rest  at  its  equilibrium  position  x  =  0, 
and  that  at  time  t  =  0  it  experiences  a  blow  of  impulse  /.  Immediately 
after  the  blow,  its  position  is  still  x  =  0  (since  the  velocity  remains  finite, 
the  position  does  not  change  discontinuously),  while  its  velocity  is  vq  = 
I/m.  Inserting  these  initial  conditions  in  the  general  solution  (2.31)  for  the 
oscillatory  case  7  <  ujq,  we  find  that  the  position  is  given  by 

{0,  t<0, 
J 
 e-T*sinwt,    t  >  0. 
raw 

This  solution  is  illustrated  in  Fig.  2.9. 

Just  as  in  the  case  of  periodic  forces,  we  can  immediately  generalize  this 
result  to  a  sum  of  impulsive  forces.  If  the  oscillator  is  subjected  to  a  series 
of  blows,  of  impulse  Ir  at  time  tr,  its  position  may  be  found  by  adding 
together  the  corresponding  set  of  solutions  of  the  form  (2.51).  We  obtain 
in  this  way 

x(t)  =  ^  G(t  -  tr)Ir  +  transient,  (2.52) 


(2.51) 
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G(t) 


t 


Fig.  2.9 


where 


G(t 


0 


0,  t  <  t', 

_e-7(*-*')  sin w(t-t'),  i>f. 


(2.53) 


The  function  G(t  —  t')  is  called  the  Green's  function  of  the  oscillator.  It 
represents  the  response  to  a  blow  of  unit  impulse  delivered  at  the  time  t'. 

We  can  now  use  this  result  to  obtain  a  solution  to  (2.34)  for  an  arbitrary 
force  F(t).  The  method  is,  in  a  sense,  complementary  to  Fourier's.  Instead 
of  expressing  F(t)  as  a  sum  of  periodic  forces,  we  express  it  as  a  sum  of 
impulsive  forces.  If  we  divide  the  time  scale  into  small  intervals  At,  then  the 
impulse  delivered  to  the  particle  between  t  and  t  +  At  will  be  approximately 
F(t)At.  Moreover,  if  At  is  short  enough,  it  will  be  a  good  approximation 
to  suppose  that  the  whole  of  this  impulse  is  delivered  instantaneously  at 
the  time  t.  Thus  the  solution  should  be  given  approximately  by  (2.52), 
in  which  the  interval  between  tr  and  tr+i  is  At  and  Ir  =  F(tr)At.  The 
approximation  will  improve  as  At  is  reduced,  and  become  exact  in  the  limit 
At  — >  0.  In  this  limit,  the  sum  in  (2.52)  goes  over  into  an  integral,  and  we 
finally  obtain 


The  lower  limit  to  is  the  initial  time  at  which  the  initial  conditions  deter- 
mining the  arbitrary  constants  in  the  transient  term  are  to  be  imposed. 
The  upper  limit  may  be  taken  to  be  t  because  G(t  —  t')  vanishes  for  t'  >  t. 
(In  other  words,  blows  subsequent  to  t  do  not  affect  the  position  at  t.) 

The  solution  (2.54)  is  very  useful  in  practice,  because  it  is  an  explicit 
solution  requiring  the  evaluation  of  only  one  integral.   It  is  particularly 


(2.54) 
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well  adapted  to  numerical  solution  of  the  problem  when  F(t)  is  known 
numerically. 

Example:  Step-function  force 

An  oscillator  is  initially  at  rest  and  is  subject  to  a  step-function 
force, 


F(t) 


0,  t  <  0, 
F,    *  >0, 


with  F  constant.  What  is  the  position  at  time  t  >  0? 

To  perform  the  integration  in  (2.54),  it  is  convenient  to  write  the 
sine  function  as  the  imaginary  part  of  an  exponential: 


x{t) 


f  G(t  -  t')Fdt'  =  —  f  Im  U-i+^-t')]  dt> 
Jo  muj  JQ       I  1 


^Im 

mco 


-■yt-\-iu)t 


F 


mu>{^2  +  u2 
F 


7  —  ico 

Im  [(7  +  iw)  (1  -  e-T*+i"*)] 


[uj(1  —  e  7t  cosojt)  —  je  7tsmojt], 


using  +  oj2  =  u)q.  Note  that  for  large  t,  this  tends  as  it  must  to  the 
displacement  for  a  constant  force  F,  namely  x  =  F/muify 


2.9    Collision  Problems 

So  far,  we  have  been  discussing  the  motion  of  a  single  particle  under  a 
known  external  force.  We  now  turn  to  the  problem  of  an  isolated  system  of 
two  bodies  moving  in  one  dimension  under  a  mutual  force  F  which  is  given 
in  terms  of  their  positions  and  velocities.  The  equations  of  motion  are  then 

m\X\  =  F,       m,2X2  =  —F.  (2.55) 

As  we  have  already  seen  in  Chapter  1,  the  sum  of  these  two  equations  leads 
to  the  law  of  conservation  of  momentum, 


Pi  +  P2  =  P  =  constant. 


(2.56) 
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According  to  the  relativity  principle,  F  must  be  a  function  only  of  the 
relative  distance  and  relative  velocity, 


Xl  -  X2 


and 


Xl  —  X2- 


When  it  is  a  function  only  of  x  the  force  is  conservative,  and  we  can  intro- 
duce a  potential  energy  function  V(x)  as  before.  The  law  of  conservation 
of  energy  then  takes  the  form 


T  +  V  =  E  =  constant, 


(2.57) 


with 


T 


^m\xl  +  ^m2X2- 


This  may  easily  be  verified  by  differentiating  (2.57)  and  using  (2.55)  and 
the  relation  between  V  and  F,  which  is  still  (2.7). 

We  are  particularly  interested  in  collision  problems,  in  which  the  force 
is  generally  small  except  when  the  bodies  are  very  close  together.  The 
potential  energy  function  is  then  a  constant  (say  zero)  for  large  values 
of  x,  and  rises  very  sharply  for  small  values.  (See  Fig.  2.10(a).)  An  ideal 
impulsive  conservative  force  corresponds  to  a  potential  energy  function  with 


(a) 


(b) 


Fig.  2.10 

a  discontinuity  or  step  (Fig.  2.10(b)).  So  long  as  the  initial  kinetic  energy 
is  less  than  the  height  of  the  step,  the  bodies  will  bounce  off  one  another. 
(If  the  kinetic  energy  exceeds  this  value,  one  body  will  pass  through  the 
other.  For  completely  impenetrable  bodies,  the  potential  step  would  have 
to  be  infinitely  high.) 

From  the  law  of  conservation  of  energy,  we  know  that  the  final  value  of 
the  kinetic  energy,  when  the  bodies  are  again  far  apart,  is  the  same  as  the 
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initial  value  well  before  the  collision.  If  we  denote  the  initial  velocities  by 
Wi,«2  and  the  final  velocities  by  V\,V2,  then  (see  Fig.  2.11) 


initial  final 


Fig.  2.11 


\m\v\  +  \miv\  =  \m\-u\  +  \rri2u\.  (2.58) 

Collisions  of  this  type,  in  which  there  is  no  loss  of  kinetic  energy,  are  known 
as  elastic  collisions.  They  are  typical  of  very  hard  bodies,  like  billiard  balls. 

We  saw  earlier  that  a  particle  which  bounces  back  from  a  potential 
barrier  emerges  with  velocity  equal  (except  for  sign)  to  its  initial  velocity. 
We  can  easily  derive  a  similar  result  for  the  case  of  two-body  collisions.  To 
do  this  we  write  the  energy  and  momentum  conservation  equations  in  the 
forms 

m\v\  —  miu\  =  rri2u\  —  rn<2.v\, 
m\V\  —  m\U\  =  7712M2  —  W-2V2- 

We  can  then  divide  the  first  equation  by  the  second,  to  obtain 

Ml  +  U\  =  «2  +  V2, 

or,  equivalently, 

v2-vi=ui-  u2.  (2.59) 

This  shows  that  the  relative  velocity  is  just  reversed  by  the  collision. 

Equation  (2.59)  and  the  momentum  conservation  equation  (2.56)  may 
be  solved  for  the  final  velocities  in  terms  of  the  initial  ones.  In  particular, 
if  the  second  body  is  initially  at  rest  (u2  =  0),  the  result  is 

mi  -  m2  2m! 

vi  =  «i,        v2  =  Ml-  (2.60) 

mi  +  m2  mi  +  m2 

Note  that  if  the  masses  are  equal,  then  the  first  body  is  brought  to  rest 
by  the  collision,  and  its  velocity  transferred  to  the  second  body.  If  mi  > 
m-2,  the  first  body  continues  in  the  same  direction,  with  reduced  velocity, 
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whereas  if  mi  <  rri2,  it  rebounds  in  the  opposite  direction.  In  the  limit 
where  m2  is  much  larger  than  mi,  we  obtain  v\  =  — «i,  which  agrees  with 
the  previous  result  for  a  particle  rebounding  from  a  fixed  potential  barrier. 

Inelastic  collisions 

In  practice  there  is  generally  some  loss  of  energy  in  a  collision,  for  instance 
in  the  form  of  heat.  In  that  case,  the  relative  velocity  will  be  reduced  in 
magnitude  by  the  collision.  We  define  the  coefficient  of  restitution  e  in  a 
particular  collision  by 

V2  -  vi  =  e(ui  -  u2).  (2-61) 

The  usefulness  of  this  quantity  derives  from  the  experimental  fact  that,  for 
any  two  given  bodies,  e  is  approximately  a  constant  for  a  wide  range  of 
velocities. 

The  final  velocities  may  again  be  found  from  (2.61)  and  the  momentum 
conservation  equation  (2.56).  When  u2  =  0,  we  get 

mi  -  em2  (1  +  e)mi 

Vi  =   ui,        V2  =   «i-  (2.62) 

mi  +  m2  mi  +  m2 

For  e  =  1  these  reduce  to  the  relations  (2.60)  for  the  case  of  an  elastic 
collision.  At  the  other  extreme,  we  have  the  case  of  very  soft  bodies  that 
stick  together  on  impact.  Then  e  =  0,  and  the  collision  is  called  perfectly 
inelastic  (or,  more  accurately  perhaps,  perfectly  plastic). 

The  energy  loss  in  an  inelastic  collision  is  easily  evaluated.  The  initial 
kinetic  energy  is  T  =  \m\u\,  while  the  final  kinetic  energy  is  T'  =  \m\v\  + 
im2u2.  Substituting  for  the  final  velocities  from  (2.62)  we  find,  after  a 
simple  calculation,  that  the  fractional  loss  of  kinetic  energy  is 

T~^f  =  (1-e>2.  (2.63) 
T  m1+m2 

This  shows,  incidentally,  that  e  must  always  be  less  than  one,  unless  some 
energy  is  released,  as  for  example  by  an  explosion. 


2.10  Summary 

The  conservation  laws  for  energy  and  momentum  are  among  the  most  im- 
portant consequences  of  Newton's  laws.  They  are  valid  quite  generally  for 
an  isolated  system,  as  we  shall  see  in  later  chapters.  For  a  system  which  is 
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only  part  of  a  larger  system,  and  subject  to  external  forces,  there  may  be 
some  transfer  of  energy  and  momentum  from  the  system  to  its  surroundings 
(or  vice  versa).  For  a  single  particle  moving  on  a  straight  line,  the  total 
(kinetic  plus  potential)  energy  is  conserved  if  the  force  is  a  function  only 
of  x.  If  the  force  is  velocity-dependent,  however,  there  will  be  some  loss  of 
energy  to  the  surroundings. 

The  importance  of  the  harmonic  oscillator,  treated  in  detail  in  this  chap- 
ter, lies  in  the  fact  that  any  system  with  one  degree  of  freedom  behaves  like 
a  harmonic  oscillator  when  sufficiently  near  to  a  point  of  stable  equilib- 
rium. The  methods  we  have  discussed  for  solving  the  oscillator  equation 
can  therefore  be  applied  to  a  great  variety  of  problems.  In  particular,  the 
phenomenon  of  resonance  can  occur  in  any  system  subjected  to  periodic 
forces.  It  occurs  when  there  is  a  natural  frequency  of  oscillation  close  to 
the  forcing  frequency,  and  leads,  if  the  damping  is  small,  to  oscillations  of 
very  large  amplitude. 


Problems 

1.  A  harmonic  oscillator  of  angular  frequency  2s_1  is  initially  at  x  = 
—3  m,  with  x  =  8ms"1.  Write  the  solution  in  each  of  the  three  forms 
(2.19),  (2.20)  and  (2.23).  Find  the  first  time  at  which  x  =  0,  and  the 
first  at  which  x  =  0.  (A  sketch  of  the  solution  may  help.) 

2.  A  weight  of  mass  m  is  hung  from  the  end  of  a  spring  which  provides 
a  restoring  force  equal  to  k  times  its  extension.  The  weight  is  released 
from  rest  with  the  spring  unextended.  Find  its  position  as  a  function 
of  time,  assuming  negligible  damping. 

3.  When  a  mass  is  suspended  from  a  spring,  the  equilibrium  length  is 
increased  by  50  mm.  The  mass  is  then  given  a  blow  which  starts  it 
moving  vertically  at  200mms_1.  Find  the  period  and  amplitude  of  the 
resulting  oscillations,  assuming  negligible  damping. 

4.  A  pendulum  of  period  2  s  is  released  from  rest  at  an  inclination  of  5°  to 
the  (downward)  vertical.  What  is  its  angular  velocity  when  it  reaches 
the  vertical?  When  it  first  returns  to  its  starting  point,  it  is  given 
an  impulsive  blow  towards  the  vertical  that  increases  the  amplitude  of 
swing  to  10°.  Find  its  subsequent  angular  position  as  a  function  of 
time. 

5.  Write  down  the  potential  energy  function  corresponding  to  the  force 
—GMm/x2  experienced  by  a  particle  of  mass  m  at  a  distance  x  from  a 
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planet  of  mass  M  m).  The  particle  is  released  from  rest  at  a  distance 
a  from  the  centre  of  the  planet,  whose  radius  is  R,  and  falls  under 
gravity.  What  is  its  speed  when  it  strikes  the  surface?  Evaluate  this 
speed  for  a  particle  falling  to  the  Earth  from  a  =  2R.  (Use  R  =  6400 
km,  and  GM/R2  =g  =  10ms"2.) 

6.  A  particle  of  mass  m  moves  under  a  force  F  =  —cx3,  where  c  is  a 
positive  constant.  Find  the  potential  energy  function.  If  the  particle 
starts  from  rest  at  x  =  —a,  what  is  its  velocity  when  it  reaches  x  =  0? 
Where  in  the  subsequent  motion  does  it  instantaneously  come  to  rest? 

7.  A  particle  of  mass  m  moves  (in  the  region  x  >  0)  under  a  force  F  = 
—kx+c/x,  where  k  and  c  are  positive  constants.  Find  the  corresponding 
potential  energy  function.  Determine  the  position  of  equilibrium,  and 
the  frequency  of  small  oscillations  about  it. 

8.  A  particle  of  mass  m  has  the  potential  energy  function  V(x)  =  mk\x\, 
where  A;  is  a  positive  constant.  What  is  the  force  when  x  >  0,  and  when 
x  <  0?  Sketch  the  function  V  and  describe  the  motion.  If  the  particle 
starts  from  rest  at  x  =  —a,  find  the  time  it  takes  to  reach  x  =  a. 

9.  A  particle  of  mass  m  moves  under  a  conservative  force  with  potential 
energy  function  given  by 


where  a  and  k  are  constants,  and  a  >  0.  What  is  the  force  on  the 
particle?  Sketch  the  function  V,  for  both  cases  k  >  0  and  k  <  0,  and 
describe  the  possible  types  of  motion. 

10.  The  particle  of  Problem  9  with  k  =  —rrwo2  <  0  is  initially  in  the  region 
x  <  —a,  moving  to  the  right  with  velocity  v.  When  it  emerges  into  the 
region  x  >  a,  will  it  do  so  earlier  or  later  than  if  it  were  moving  freely 
under  no  force?  Find  an  expression  for  the  time  difference.  (To  do  the 
required  integral,  use  a  substitution  of  the  form  x  =  constant  x  sin6>.) 

11.  The  potential  energy  function  of  a  particle  of  mass  m  is  V  =  —  \c(x2  — 
a2)2,  where  c  and  a  are  positive  constants.  Sketch  this  function,  and 
describe  the  possible  types  of  motion  in  the  three  cases  (a)  E  >  0,  (b) 
E  <  -ica4,  and  (c)  -|ca4  <  E  <  0. 

12.  *The  potential  energy  function  of  a  particle  of  mass  rn  is  V  =  cx/(x2  + 
a2),  where  c  and  a  are  positive  constants.  Sketch  V  as  a  function  of 
x.  Find  the  position  of  stable  equilibrium,  and  the  period  of  small 
oscillations  about  it.    Given  that  the  particle  starts  from  this  point 


^k(a2  -  x2)  for  \x\  <  a, 
0  for  \x\  >  a, 
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with  velocity  v,  find  the  ranges  of  values  of  v  for  which  it  (a)  oscillates, 
(b)  escapes  to  —  oo,  and  (c)  escapes  to  +00. 

13.  *A  particle  falling  under  gravity  is  subject  to  a  retarding  force  propor- 
tional to  its  velocity.  Find  its  position  as  a  function  of  time,  if  it  starts 
from  rest,  and  show  that  it  will  eventually  reach  a  terminal  velocity. 
[The  equation  of  motion  can  be  integrated  once  to  give,  with  a  suitable 
choice  of  origin  and  definition  of  7  (differing  from  (2.28)  by  a  factor  of 
2),  2  +  72  =  —  gt.  To  integrate  again,  use  an  integrating  factor,  i.e. 
a  function  f(t)  such  that  when  the  equation  is  multiplied  by  f(t)  the 
left-hand  side  becomes  an  exact  derivative,  in  fact  the  derivative  of  2/. 
The  final  stage  requires  an  integration  by  parts.] 

14.  The  terminal  speed  of  the  particle  in  Problem  13  is  50ms-1.  Find  the 
time  it  takes  to  reach  a  speed  of  40ms-1,  and  the  distance  it  has  fallen 
in  that  time.  (Take  g  =  10ms-2.) 

15.  *A  particle  moves  vertically  under  gravity  and  a  retarding  force  pro- 
portional to  the  square  of  its  velocity.  (This  is  more  appropriate  than 
a  linear  relation  for  larger  particles  —  see  Problem  8.10.)  If  v  is  its  up- 
ward or  downward  speed,  show  that  v  =  ^fg  —  kv2,  respectively,  where  k 
is  a  constant.  If  the  particle  is  moving  upwards,  show  that  its  position 
at  time  t  is  given  by  2  =  20  +  (1/fc)  \ncos[\fgk  (to  —  t)],  where  zq  and 
to  are  integration  constants.  If  its  initial  velocity  at  t  =  0  is  u,  find 
the  time  at  which  it  comes  instantaneously  to  rest,  and  its  height  then. 
[Note  that  In  always  denotes  the  natural  logarithm:  In  a;  =  loge  x.  You 
may  find  the  identity  In  cos  x  =  —  |  ln(l  +  tan2  a;)  useful.] 

16.  *Show  that  if  the  particle  of  the  previous  question  falls  from  rest  its 
speed  after  a  time  t  is  given  by  v  =  y ' g/k  t&i\}i(^fgk i) .  What  is  its 
limiting  speed?  How  long  does  it  take  to  hit  the  ground  if  dropped 
from  height  hi 

17.  The  pendulum  described  in  §2.1  is  released  from  rest  at  an  angle  9o  to 
the  downward  vertical.  Find  its  angular  velocity  as  a  function  of  9,  and 
the  period  of  small  oscillations  about  the  position  of  stable  equilibrium. 
Write  down  the  solution  for  6  as  a  function  of  time,  assuming  that  6q 
is  small. 

18.  *For  the  pendulum  described  in  §2.1,  find  the  equation  of  motion  for 
small  displacements  from  the  position  of  uristable  equilibrium,  9  =  it. 
Show  that  if  it  is  released  from  rest  at  a  small  angle  to  the  upward 
vertical,  then  the  time  taken  for  the  angular  displacement  to  increase  by 
a  factor  of  10  will  be  approximately  y/l/gln.20.  Evaluate  this  time  for 
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a  pendulum  of  period  2  s,  and  find  the  angular  velocity  of  the  pendulum 
when  it  reaches  the  downward  vertical. 

19.  *The  particle  of  Problem  11  starts  from  rest  at  x  =  —a,  and  is  given  a 
small  push  to  start  it  moving  to  the  right.  What  is  its  velocity  when 
it  reaches  the  point  x?  Given  that  t  =  0  is  the  instant  when  it  reaches 
x  =  0,  find  its  position  as  a  function  of  time.  (Assume  that  the  push 
has  negligible  effect  on  its  energy.) 

20.  *A  particle  of  mass  m  moves  in  the  region  x  >  0  under  the  force 
F  =  —rnw'2(x  —  a4/x3),  where  u>  and  a  are  constants.  Sketch  the 
potential  energy  function.  Find  the  position  of  equilibrium,  and  the 
period  of  small  oscillations  about  it.  The  particle  starts  from  this  point 
with  velocity  v.  Find  the  limiting  values  of  x  in  the  subsequent  motion. 
Show  that  the  period  of  oscillation  is  independent  of  v.  (To  do  the 
integration,  transform  to  the  variable  y  =  x  ,  then  add  a  constant  to 
'complete  the  square',  and  finally  use  a  trigonometric  substitution.) 

21.  Repeat  the  calculation  of  Problem  2  assuming  that  the  system  is  crit- 
ically damped.  Given  that  the  final  position  of  equilibrium  is  0.4  m 
below  the  point  of  release,  find  how  close  to  the  equilibrium  position 
the  particle  is  after  1  s. 

22.  *A  pendulum  whose  period  in  a  vacuum  is  1  s  is  placed  in  a  resistive 
medium.  Its  amplitude  on  each  swing  is  observed  to  be  half  that  on  the 
previous  swing.  What  is  its  new  period?  Given  that  the  pendulum  bob 
is  of  mass  0.1  kg,  and  is  subjected  to  a  periodic  force  of  amplitude  0.02  N 
and  period  1  s,  find  the  angular  amplitude  of  the  resulting  forced  oscil- 
lation. Compare  your  answer  with  the  angular  deviation  that  would  be 
induced  by  a  constant  force  of  this  magnitude  in  a  pendulum  at  rest. 

23.  Write  down  the  solution  to  the  oscillator  equation  for  the  case  u>q  >  7 
if  the  oscillator  starts  from  x  =  0  with  velocity  v.  Show  that,  as  ujq  is 
reduced  to  the  critical  value  7,  the  solution  tends  to  the  corresponding 
solution  for  the  critically  damped  oscillator. 

24.  *Solve  the  problem  of  an  oscillator  under  a  simple  periodic  force  (turned 
on  at  t  =  0)  by  the  Green's  function  method,  and  verify  that  it  repro- 
duces the  solution  of  §2.6.  [Assume  that  the  damping  is  less  than 
critical.  To  do  the  integral,  write  svaujt  as  (elujt  —  e_1"*)/2i.] 

25.  *For  an  oscillator  under  a  periodic  force  F(t)  =  F\  coswii,  calculate  the 
power,  the  rate  at  which  the  force  does  work.  (Neglect  the  transient.) 
Show  that  the  average  power  is  P  =  mjutfal,  and  hence  verify  that  it 
is  equal  to  the  average  rate  at  which  energy  is  dissipated  against  the 
resistive  force.  Show  that  the  power  P  is  a  maximum,  as  a  function 
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of  u)\,  at  uji  =  ujo,  and  find  the  values  of  uj\  for  which  it  has  half  its 
maximum  value. 

26.  *Find  the  average  value  E  of  the  total  energy  of  an  oscillator  under  a 
periodic  force.  If  W  is  the  work  done  against  friction  in  one  period, 
show  that  when  u>\  =  ujq  the  ratio  W/ E  is  related  to  the  quality  factor 
Q  by  W/E  =  2-k/Q. 

27.  Three  perfectly  elastic  bodies  of  masses  5  kg,  1  kg,  5  kg  are  arranged  in 
that  order  on  a  straight  line,  and  are  free  to  move  along  it.  Initially, 
the  middle  one  is  moving  with  velocity  27ms_1,  and  the  others  are  at 
rest.  Find  how  many  collisions  take  place  in  the  subsequent  motion, 
and  verify  that  the  final  value  of  the  kinetic  energy  is  equal  to  the  initial 
value. 

28.  A  ball  is  dropped  from  height  h  and  bounces.  The  coefficient  of  resti- 
tution at  each  bounce  is  e.  Find  the  velocity  immediately  after  the 
first  bounce,  and  immediately  after  the  nth  bounce.  Show  that  the  ball 
finally  comes  to  rest  after  a  time 

1  +  e 
1  -  e 

29.  *An  oscillator  with  free  period  r  is  critically  damped  and  subjected  to 
a  force  with  the  'saw-tooth'  form 

F(t)  =  c(t  -  nr)       for       (n  -  \)t  <t<  (n  +  §)r, 

for  each  integer  n  (c  is  a  constant).  Find  the  ratios  of  the  amplitudes 
an  of  oscillation  at  the  angular  frequencies  2-kti/t. 

30.  *A  particle  moving  under  a  conservative  force  oscillates  between  x\  and 
x2  ■  Show  that  the  period  of  oscillation  is 

T  =  2JX1  ^2[V(x2)n-V(x)]dX- 

In  particular,  if  V  =  ^moj^x2  —  foe4),  show  that  the  period  for  oscilla- 
tions of  amplitude  a  is 

_  _2_  r  dx  

J -a  Va2  -  x2^l  -  b(a2  +  x2) ' 

Using  the  binomial  theorem  to  expand  in  powers  of  b,  and  the  substi- 
tution x  =  as'mO,  show  that  for  small  amplitude  the  period  is  approx- 
imately t  «  27r(l  +  jba2)/u)0. 
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31.  Use  the  result  of  the  preceding  question  to  obtain  an  estimate  of  the 
correction  factor  to  the  period  of  a  simple  pendulum  when  its  angular 
amplitude  is  30°.  (Write  cos6>  «  1  -  ±6>2  +  ^6»4.) 

32.  *Find  the  Green's  function  of  an  oscillator  in  the  case  7  >  u>o-  Use  it 
to  solve  the  problem  of  an  oscillator  that  is  initially  in  equilibrium,  and 
is  subjected  from  t  =  0  to  a  force  increasing  linearly  with  time,  F  =  ct. 


Chapter  3 

Energy  and  Angular  Momentum 


In  this  chapter,  we  generalize  the  discussion  of  Chapter  2  to  the  case  of 
motion  in  two  or  three  dimensions.  Throughout  this  chapter,  we  shall  be 
concerned  with  the  problem  of  a  particle  moving  under  a  known  external 
force  F. 

3.1    Energy;  Conservative  Forces 

The  kinetic  energy  of  a  particle  of  mass  m  free  to  move  in  three  dimensions 
is  defined  to  be 

T  =±mr2  =  \m{x2 +y2 +z2).  (3.1) 

The  rate  of  change  of  the  kinetic  energy  is,  therefore, 

T  =  m(xx  +  yy  +  zz)  =  mr  ■  r  =  r  ■  F,  (3.2) 

by  the  equation  of  motion  (1.1).  The  change  in  kinetic  energy  in  a  time 
interval  At  during  which  the  particle  moves  a  (vector)  distance  dr  is  then 

AT  =  AW,  (3.3) 

with 

AW  =  F  ■  Ar  =  FxAx  +  FyAy  +  Fzdz.  (3.4) 

This  is  the  three-dimensional  expression  for  the  work  done  by  the  force  F 
in  the  displacement  dr.  Note  that  it  is  equal  to  the  distance  travelled,  |dr|, 
multiplied  by  the  component  of  F  in  the  direction  of  the  displacement. 


49 


50 


Classical  Mechanics 


One  might  think  at  first  sight  that  a  conservative  force  in  three  dimen- 
sions should  be  defined  to  be  a  force  F(r)  depending  only  on  the  position 
r  of  the  particle.  However,  this  is  not  sufficient  to  ensure  the  existence  of  a 
law  of  conservation  of  energy,  which  is  the  essential  feature  of  a  conservative 
force.  We  require  that 

T  +  V  =  E  =  constant,  (3.5) 

where  T  is  given  by  (3.1),  and  the  potential  energy  V  is  a  function  of 
position,  V(r).  Now,  the  rate  of  change  of  the  potential  energy  is 

•  ,  N     dV  .     dV  .     dV  . 
V(r)  =  —x  +  —y  +  —z, 
ox        oy  oz 

or,  in  terms  of  the  gradient  of  V, 

.dV      .dV  ,dV 

ox       oy  oz 

(see  (A.21)),  by 

V  =  r  ■  VV.  (3.6) 
Tims,  differentiating  (3.5),  and  using  (3.2)  and  (3.6)  for  T  and  V,  we  obtain 

r-{F  +  W)  =  0. 

Since  this  must  hold  for  any  value  of  the  velocity  r  of  the  particle,  the 
quantity  in  parentheses  must  vanish,  i.  e. 

F(r)  =  -W(r).  (3.7) 

This  is  the  analogue  for  the  three-dimensional  case  of  (2.7).  In  terms  of 
components,  it  reads 

_        dV  dV  dV 

Fx--te>  z""a7-  (3-8) 

For  example,  suppose  that  V  has  the  form 

V=  \kr2  =  \k{x2 +  y2  +  z2),  (3.9) 
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which  describes  a  three-dimensional  harmonic  oscillator  (discussed  in  more 
detail  in  Chapter  4).  Then  (3.8)  yields 

F  =  (—kx,  —ky,  —kz)  =  —kr.  (3.10) 

It  is  not  hard  to  find  the  necessary  and  sufficient  condition  on  the  force 
F(r)  to  ensure  the  existence  of  a  potential  energy  function  V(r)  satisfying 
(3.7).  Any  vector  function  F(r)  of  the  form  (3.7)  obeys  the  relation 

VAF  =  0,  (3.11) 

that  is,  its  curl  vanishes  (see  (A. 26)).  For  instance,  the  z  component  of 
(3.11)  is 

dx  dy 

and  this  is  true  because  of  (3.8)  and  the  symmetry  of  the  second  partial 
derivative: 

d2V  _  d2V 
dx  dy     dy  dx 

Thus  (3.11)  is  a  necessary  condition  for  the  force  F (r)  to  be  conservative. 
It  can  be  shown  (see  Appendix  A)  that  it  is  also  a  sufficient  condition.  More 
specifically,  if  the  force  F(r)  satisfies  this  condition,  then  the  work  done  by 
the  force  in  a  displacement  from  r0  to  r  is  independent  of  the  path  chosen 
between  these  points.  Therefore,  we  can  define  the  potential  energy  V(r) 
in  terms  of  the  force  by  a  relation  analogous  to  (2.5). 

3.2  Projectiles 

As  a  first  example  of  motion  in  two  dimensions  let  us  consider  a  projectile 
moving  under  gravity. 

The  equation  of  motion,  assuming  that  atmospheric  friction  is  negligible, 

is 

mr  =  mg,  (3-12) 

where  g  is  the  acceleration  due  to  gravity,  a  vector  of  constant  magnitude 
g  =  9.81  ms-2  directed  in  the  downward  vertical  direction.  (We  ignore  the 
small  effects  due  to  the  Earth's  rotation,  which  will  be  discussed  in  Chapter 
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5,  as  well  as  the  variation  of  g  with  height  —  see  Chapter  6.)  If  we  choose 
the  z-axis  vertically  upwards,  then  g  =  —gk,  so  that  (3.12)  takes  the  form 

i  =  0,       y  =  0,       z  =  -g.  (3.13) 

If  we  choose  the  x-  and  y-ax.es  so  that  initially  the  particle  is  moving  in 
the  icz-plane,  then  it  will  do  so  always.  Thus  the  motion  is  effectively 
two-dimensional.  The  solutions  of  (3.13)  are 

x  =  a  +  bt,       z  =  c  +  dt  —  \gt2 , 

where  a,  b,  c,  d  are  constants  to  be  fixed  by  the  initial  conditions. 

Example:  Projectile  range 

A  projectile  is  launched  from  the  surface  of  the  Earth  (assumed 
flat)  with  velocity  v  at  an  angle  a  to  the  horizontal.  What  is  its 
range? 

We  may  choose  the  point  of  launch  as  the  origin,  and  the  time  of 
launch  as  t  =  0.  At  that  instant, 

x  =  vcosa,        i  =  u  sin  a. 

Thus  the  solution  with  the  required  initial  conditions  is 

x  =  vtcosa,       z  =  vt  sin  a  —  \gt2.  (3-14) 

(The  trajectory  in  the  icz-plane  is  obtained  by  eliminating  t  between 
these  equations,  and  is  easily  seen  to  be  a  parabola.  However,  the  explicit 
equation  is  not  needed  here.)  The  projectile  will  hit  the  ground  when 
z  =  0  again,  i.e.,  at  the  time  t  =  2(vsma)/g.  Substituting  into  the  first 
of  Eqs.  (3.14)  yields  for  the  range 

x  =  2(v2  cos  a  sin  a)/(?.  (3.15) 

It  is  interesting  to  note  that  to  maximize  the  range  we  should  choose 
a  =  it/4;  the  maximum  value  is  :cmax  =  v2/g. 

Finally,  let  us  consider  the  much  less  trivial  problem  of  a  projectile 
subject  also  to  a  resistive  force,  such  as  atmospheric  drag.  As  in  Chapter  2, 
we  assume  that  the  retarding  force  is  proportional  to  the  velocity,  so  that 
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the  equation  of  motion  becomes 

mr  =  —Xr  +  mg.  (3.16) 

The  actual  dependence  of  atmospheric  drag  on  velocity  is  definitely  non- 
linear, but  this  equation  nevertheless  gives  a  reasonable  qualitative  picture 
of  the  motion. 

Defining  7  =  A/m,  we  may  write  (3.16)  as 

x  =  —jx,        z  =  —jz  —  g. 

These  equations  may  be  integrated  by  the  use  of  integrating  factors  (see 
Chapter  2,  Problem  13).  If  the  initial  velocity  is  v  =  (u,  0,  w),  the  required 
solution  is 

*=^(l-e-*),  *=(»+4)(l-e-*)-£. 
7  V7      72/  7 

Eliminating  t  between  these  two  equations  yields  the  equation  of  the  tra- 
jectory, 

x=T!!±±x  +  ^(i-TL).  (3.i7) 

A  distinctive  feature  of  this  equation  is  that,  even  if  the  launch  point  is  far 
above  the  ground,  there  is  a  maximum  value  of  x.  The  projectile  can  never 
reach  beyond  x  =  u/j,  and  as  it  approaches  that  value,  z  — >  —  00.  Thus 
the  trajectory  ends  in  a  near- vertical  drop,  when  all  the  forward  momentum 
has  been  spent. 

3.3    Moments;  Angular  Momentum 

The  moment  about  the  origin  of  a  force  F  acting  on  a  particle  at  position 
v  is  defined  to  be  the  vector  product 

G  =  r  A  F.  (3.18) 

The  components  of  the  vector  G  are  the  moments  about  the  x-,  y-  and 
z-axes, 

Gx  =  yFz  —  zFy,       Gy  =  zFx  —  xFz:       Gz  =  xFy  yFx. 


The  direction  of  the  vector  G  is  that  of  the  normal  to  the  plane  of  r  and 
F.  It  may  be  regarded  as  defining  the  axis  about  which  the  force  tends  to 
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rotate  the  particle.  The  magnitude  of  G  is 

G  =  rFsm6  =  bF, 

where  9  is  the  angle  between  r  and  F,  and  b  is  the  perpendicular  distance 
from  the  origin  to  the  line  of  the  force.  (See  Fig.  3.1.)  Moments  of  forces 


Fig.  3.1 

play  a  particularly  important  role  in  the  dynamics  of  rigid  bodies,  as  we 
shall  see  in  Chapters  8  and  9. 

Correspondingly,  we  define  the  vector  angular  momentum  (sometimes 
called  moment  of  momentum)  about  the  origin  of  a  particle  at  position  r, 
and  moving  with  momentum  p,  to  be 

J  =  rAp  =  mrAr.  (3.19) 

Its  components,  the  angular  momenta  about  the  x-,  y-  and  z-axes,  are 

Jx  =  m(yz  —  zy),       Jy  =  m(zx  —  xz),       Jz  =  m(xy  —  yx). 

The  momentum  p  is  often  called  linear  momentum  when  it  is  important  to 
emphasize  the  distinction  between  p  and  J. 

The  rate  of  change  of  angular  momentum,  obtained  by  differentiating 
(3.19)  is 

j  =  m^(r  A  f)  =  m(f  A  f  +  r  A  f).  (3.20) 

(Recall  that  the  usual  rule  for  differentiating  a  product  applies  also  to  vector 
products,  provided  that  the  order  of  the  two  factors  is  preserved.)  Now, 
the  first  term  in  (3.20)  is  zero,  because  it  is  the  vector  product  of  a  vector 
with  itself.  By  (1.1),  the  second  term  is  simply  r  A  F  =  G.  Thus  we  obtain 
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the  important  result  that  the  rate  of  change  of  the  angular  momentum  is 
equal  to  the  moment  of  the  applied  force: 

j  =  G.  (3.21) 

This  should  be  compared  with  the  equation  p  =  F  for  the  rate  of  change 
of  the  linear  momentum. 

Since  the  definition  of  the  vector  product  (see  Appendix,  §A.3)  depends 
on  the  choice  of  a  right-hand  screw  convention,  the  directions  of  the  vectors 
G  and  J  also  depend  on  this  convention.  A  vector  of  this  type  is  known 
as  an  axial  vector.  It  is  to  be  contrasted  with  an  ordinary,  or  polar,  vector. 
Axial  vectors  are  often  associated  with  rotation  about  an  axis.  What  is 
specified  physically  is  not  the  direction  along  the  axis,  but  the  sense  of 
rotation  about  it  (see  Fig.  3.1). 

3.4    Central  Forces;  Conservation  of  Angular  Momentum 

An  external  force  is  said  to  be  central  if  it  is  always  directed  towards  or 
away  from  a  fixed  point,  called  the  centre  of  force.  If  we  choose  the  origin 
to  be  this  centre,  this  means  that  F  is  always  parallel  to  the  position  vector 
r.  Since  the  vector  product  of  two  parallel  vectors  is  zero,  the  condition  for 
a  force  F  to  be  central  is  that  its  moment  about  the  centre  should  vanish: 

G  =  r  A  F  =  0.  (3.22) 

From  (3.21)  it  follows  that  if  the  force  is  central,  the  angular  momentum  is 
a  constant: 

J  =  constant.  (3.23) 

This  is  the  law  of  conservation  of  angular  'momentum  in  its  simplest  form. 
It  really  contains  two  statements:  that  the  direction  of  J  is  constant,  and 
that  its  magnitude  is  constant.  Let  us  look  at  these  in  turn. 

The  direction  of  J  is  that  of  the  normal  to  the  plane  of  r  and  v  =  r. 
Hence,  the  statement  that  this  direction  is  fixed  implies  that  r  and  v  must 
always  lie  in  a  fixed  plane.  In  other  words,  the  motion  of  the  particle  is 
confined  to  the  plane  containing  the  initial  position  vector  and  velocity 
vector  (see  Fig.  3.2).  This  is  obvious  physically;  for,  since  the  force  is 
central,  it  has  no  component  perpendicular  to  this  plane,  and,  since  the 
normal  component  of  velocity  is  initially  zero,  it  must  always  remain  zero. 
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Fig.  3.2 

To  understand  the  meaning  of  the  second  part  of  the  law,  the  constancy 
of  the  magnitude  J  of  J,  it  is  convenient  to  introduce  polar  co-ordinates 
r  and  9  in  the  plane  of  the  motion.  In  a  short  time  interval,  in  which  the 
co-ordinates  change  by  amounts  dr  and  d6>,  the  distances  travelled  in  the 
radial  and  transverse  directions  are  dr  and  rdd,  respectively  (see  Fig.  3.3). 
Thus  the  radial  and  transverse  components  of  the  velocity  are 


Fig.  3.3 


vr  =  r,       vg  =  r6.  (3.24) 

The  magnitude  of  the  angular  momentum  is  then 

J  =  mrvg  =  mr29.  (3.25) 

It  is  now  easy  to  find  a  geometrical  interpretation  of  the  statement  that  J 
is  a  constant.  We  note  that  when  the  angle  6  changes  by  an  amount  d8, 
the  radius  vector  sweeps  out  an  area 


d^  =  \r2&6. 
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Thus  the  rate  of  sweeping  out  area  is 

A  A  .  J 

—  =  lr29  =   =  constant.  (3.26) 

At      2  2m  v  ' 

One  of  the  most  important  applications  is  to  planetary  motion,  and  in 
that  context,  the  constancy  of  the  rate  of  sweeping  out  area  is  generally 
referred  to  as  Kepler's  second  law,  although  as  we  have  seen  it  applies 
more  generally  to  motion  under  any  central  force.  It  applies  even  if  the 
force  is  non-conservative  —  for  example,  to  a  particle  attached  to  the  end 
of  a  string  which  is  gradually  being  wound  in  towards  the  centre.  An 
equivalent  statement  of  the  law,  which  follows  at  once  from  (3.25),  is  that 
the  transverse  component  of  the  velocity,  vg ,  varies  inversely  with  the  radial 
distance  r. 

When  the  force  is  both  central  and  conservative,  the  two  conservation 
laws,  (3.5)  and  (3.23),  together  give  us  a  great  deal  of  information  about 
the  motion  of  the  particle.  This  important  special  case  is  the  subject  of  the 
next  chapter. 


3.5    Polar  Co-ordinates 

In  problems  with  particular  symmetries,  it  is  often  convenient  to  use  non- 
Cartesian  co-ordinates.  In  particular,  in  the  case  of  axial  or  spherical  sym- 
metry, we  may  use  cylindrical  polar  co-ordinates  p,tp,z,  or  spherical  polar 
co-ordinates  r,  6,  ip.  These  are  related  to  Cartesian  co-ordinates  by 


Fig.  3.4 
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x  =  p  cos  tp  =  rsin6*cos<^, 

y  =  p  sin  tp  =  r  sin  8  sin  ip,  (3.27) 
z  =  z         =  rcos9. 

(See  Fig.  3.4.)  The  relation  between  the  two  is  clearly 

z  =  r  cos  8 

p  =  rsin8,  (3.28) 

ip  =  ip. 

It  is  easy  to  find  the  components  of  velocity  in  the  various  co-ordinate  di- 
rections, analogous  to  (3.24).  In  the  cylindrical  polar  case,  the  co-ordinates 
p  and  ip  are  just  plane  polar  co-ordinates  in  the  xy-pl&ne,  and  we  obviously 
have 

vp  =  p,       vv=  pip       vz  =  z.  (3.29) 

In  the  spherical  polar  case,  a  curve  along  which  8  varies,  while  r  and  p  are 
fixed,  is  a  circle  of  radius  r  (corresponding  on  the  Earth  to  a  meridian  of 
longitude).  A  curve  along  which  only  ip  varies  is  a  circle  of  radius  p  =  r  sin  8 
(corresponding  to  a  parallel  of  latitude).  Thus  the  elements  of  length  in 
the  three  co-ordinate  directions  are  dr,  r  d8  and  rsm9dip,  and  we  get 

vr  =  r,        vg  =  rd,        vv  =  rsm8ip.  (3.30) 

For  use  later  in  this  chapter,  it  will  be  very  useful  to  have  expressions 
for  the  kinetic  energy  in  terms  of  polar  co-ordinates.  These  co-ordinates, 
though  curvilinear,  are  still  orthogonal  in  the  sense  that  the  three  co- 
ordinate directions  at  each  point  are  mutually  perpendicular.  Thus  we 
can  write,  for  example, 

2  2,2,2 
v    =vp  +  vv+vz- 

(This  would  not  be  true  if  the  co-ordinates  were  oblique.)  Hence  from  (3.29) 
and  (3.30)  we  find 

T=  \m{p2  +  p2p2  +  z2),  (3.31) 

and 


T  =  \m(r2  +  r292  +  r2  sin2  8  p2). 


(3.32) 


Energy  and  Angular  Momentum 


59 


Note  the  characteristic  difference  between  these  expressions  and  the  corre- 
sponding one  for  the  case  of  Cartesian  co-ordinates:  here  T  depends  not 
only  on  the  time  derivatives  of  the  co-ordinates,  but  also  on  the  co-ordinates 
themselves. 

We  could  now  go  on  and  find  expressions  for  the  acceleration  in  terms 
of  these  co-ordinates,  and  hence  write  down  equations  of  motion.  While  it 
is  not  particularly  hard  to  do  this  directly  in  these  simple  cases,  it  is  very 
convenient  to  have  a  general  prescription  for  writing  down  the  equations  of 
motion  in  arbitrary  co-ordinates.  A  method  of  doing  this  that  we  shall  use 
extensively  later  is  due  to  the  eighteenth-century  mathematician  Joseph- 
Louis  Lagrange.  It  can  be  done  as  soon  as  we  have  found  the  expression 
for  the  kinetic  energy,  like  (3.31)  or  (3.32). 

The  simplest  —  though  not  the  only  —  way  of  deriving  Lagrange's 
equations  is  to  use  what  is  known  as  a  variational  principle.  This  is  a 
principle  which  states  that  some  quantity  has  a  minimum  value,  or,  more 
generally,  a  stationary  value.  Principles  of  this  kind  are  used  in  many 
branches  of  physics,  particularly  in  quantum  mechanics  and  in  optics.  It 
may  therefore  be  useful,  before  discussing  the  particular  principle  we  shall 
require,  to  consider  the  general  kind  of  problem  in  which  such  methods  are 
used.  This  we  shall  do  in  the  next  section. 

3.6    The  Calculus  of  Variations 

Let  us  begin  with  what  may  seem  a  ridiculously  simple  example:  what  is 
the  shortest  path  between  two  points  in  a  plane?  Of  course  we  know  the 
answer  already,  but  the  method  we  shall  use  to  derive  it  can  also  be  applied 
to  less  trivial  examples  —  for  instance,  to  find  the  shortest  path  between 
two  points  on  a  curved  surface. 

Suppose  the  two  points  are  (xo,yo)  and  {xi,yi).  Any  curve  joining  them 
is  represented  by  an  equation  y  =  y(x)  such  that  the  function  satisfies  the 
boundary  conditions 

y(xo)  =  2/o,       y(xi)  =  2/i.  (3.33) 

Consider  two  neighbouring  points  on  this  curve.  The  distance  dl  between 
them  is  given  by 

dl  =  \J dx2  +  dy2  =  \f\  +  y'2  dx, 
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where  y'  =  dy /dx.  Thus  the  total  length  of  the  curve  is 

I  =  [  1  \A  +  V'2  dx.  (3.34) 

The  problem,  therefore,  is  to  find  that  function  y(x),  subject  to  the  bound- 
ary conditions  (3.33),  that  will  make  this  integral  a  minimum. 

This  problem  differs  from  the  more  familiar  kind  of  minimum-value 
problem  in  that  what  we  have  to  vary  is  not  just  a  single  variable  or  set 
of  variables,  but  a  function  y{x).  However,  we  can  still  apply  the  same 
criterion:  when  the  integral  has  a  minimum  value,  it  must  be  unchanged 
to  first  order  by  making  a  small  variation  in  the  function  y(x).  (We  shall 
not  be  concerned  with  the  problem  of  distinguishing  maxima  from  minima. 
All  we  shall  do  is  to  find  the  stationary  values.) 

More  generally,  we  may  be  interested  in  finding  the  stationary  values  of 
an  integral  of  the  form 

1=1  1  f(y,y')dx,  (3.35) 

J  Xo 

where  f(y,  y')  is  a  specified  function  of  y  and  its  first  derivative.  We  shall 
solve  this  general  problem,  and  then  apply  the  result  to  the  integral  (3.34). 

Consider  a  small  variation  5y(x)  in  the  function  y(x),  subject  to  the 
condition  that  the  values  of  y  at  the  end-points  are  unchanged  (see  Fig.  3.5): 


Fig.  3.5 


5y(x0)  =  0,       <$j/(xi)  =  0. 


(3.36) 
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To  first  order,  the  variation  in  f{y,y')  is 

where 

Thus  the  variation  of  the  integral  I  is 


Sy' 


d(Sy) 
da; 


5I=  r  (df5    |  Of  d(Sy)\x 
Jx0  \9y  V    dy'   dx  J  x' 


I  Xo 

In  the  second  term,  we  may  integrate  by  parts.  The  integrated  term,  namely 
vanishes  because  of  the  conditions  (3.36).  Hence  we  obtain 


SI 


1  Xo 

^dy 

— 

dx 

\dy>)_ 

Sy(x)  dx. 


(3.37) 


Now,  in  order  that  I  be  stationary,  this  variation  SI  must  vanish  for  an 
arbitrary  small  variation  Sy(x)  (subject  only  to  the  boundary  conditions 
(3.36)).  This  is  only  possible  if  the  integrand  vanishes  identically.  Thus  we 
require 


Of 


dy     dx  \dy' 


df 


0. 


(3.38) 


This  is  known  as  the  Euler-Lagrange  equation.  It  is  in  general  a  second- 
order  differential  equation  for  the  unknown  function  y(x),  whose  solution 
contains  two  arbitrary  constants  that  may  be  determined  from  the  bound- 
ary conditions. 

We  can  now  return  to  solving  the  problem  we  started  with,  of  finding 
the  curves  of  shortest  length  (called  geodesies)  in  a  plane,  pretending  for 
the  moment  that  we  don't  know  the  answer! 

Example:  Geodesies  on  a  plane 

What  is  the  shortest  path  between  two  given  points  in  a  plane? 


In  this  case,  comparing  (3.34)  with  (3.35),  we  have  to  choose  /  = 
+  y'2  and,  therefore, 
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Thus  the  Euler-Lagrange  equation  (3.38)  reads 


This  equation  states  that  the  expression  inside  the  parentheses  is  a  con- 
stant, and  hence  that  y'  is  a  constant.  Its  solutions  are  therefore  the 
straight  lines 

y  =  ax  +  b       (a,b  =  constants). 

Thus  we  have  proved  that  the  shortest  path  between  two  points  in  a 
plane  is  a  straight  line!  The  constants  a  and  b  are  of  course  fixed  by  the 
conditions  (3.33). 

So  far,  we  have  used  x  as  the  independent  variable,  but  in  the  applica- 
tions we  consider  later  we  shall  be  concerned  instead  with  functions  of  the 
time  t.  It  is  easy  to  generalize  the  discussion  to  the  case  of  a  function  /  of 
n  variables  q\,qi,  ■  ■  ■  ,qn  and  their  time  derivatives  q\,  c[2, . . . ,  q„.  In  order 
that  the  integral 

1  =  I  f{qi,---,qn,qi,---,qn)<it 
J  t0 

be  stationary,  it  must  be  unchanged  to  first  order  by  a  variation  in  any  one 
of  the  functions  qi(t)  (i  =  1,2, ...  ,n),  subject  to  the  conditions  Sqi(to)  = 
5qi{t\)  =  0.  Thus  we  require  the  n  Euler-Lagrange  equations 

|-s(I)=»'    <  =  '•»■■••.-  <3-> 

These  n  second-order  partial  differential  equations  determine  the  n  func- 
tions qi(t)  to  within  2n  arbitrary  constants  of  integration. 

3.7    Hamilton's  Principle;  Lagrange's  Equations 

We  shall  now  show  that  the  equations  of  motion  for  a  particle  moving 
under  a  conservative  force  can  be  written  as  the  Euler-Lagrange  equations 
corresponding  to  a  suitable  integral.  We  define  the  Lagrangian  function 


L  =  T-V  =  im(i2  +y2  +  z2)  -  V(x,  y,  z). 


(3.40) 
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(Note  the  minus  sign  —  this  is  not  the  total  energy.)  Its  derivatives  are 

9L        .  dL        dV  „ 

=  mx=pxt  =  _       =p  (3.4i) 

ox  ox  ox 

with  similar  equations  for  the  y  and  z  components.  Thus  the  equation  of 
motion  px  =  Fx  can  be  written  in  the  form 

(3  42) 

At\dx)~dx  {6Al) 

But  this  has  precisely  the  form  of  an  Euler-Lagrange  equation  for  the  in- 
tegral 

/=  [  *  Ldt.  (3.43) 

J  to 

This  is  generally  known  as  the  action  integral.  Strictly  speaking,  this  ter- 
minology does  not  accord  with  the  original,  in  which  the  term  'action'  was 
used  for  a  related  but  different  quantity  (the  integral  of  twice  the  kinetic 
energy),  which  is  relatively  little  used.  However,  it  is  more  convenient, 
and  has  now  become  standard  practice,  to  use  this  simple  name  for  the 
commonly  occurring  function  (3.43). 

Thus  we  have  arrived  at  Hamilton's  principle  (named  after  the 
nineteenth-century  Irish  mathematician  and  physicist  Sir  William  Rowan 
Hamilton):  the  action  integral  I  is  stationary  under  arbitrary  variations 
5x,  5y,  Sz  which  vanish  at  the  limits  of  integration  to  and  t±. 

The  importance  of  this  principle  lies  in  the  fact  that  it  can  immediately 
be  applied  to  any  set  of  co-ordinates.  If,  instead  of  using  Cartesian  co- 
ordinates x,y,z,  we  employ  a  set  of  curvilinear  co-ordinates  <7i,  <?2>  (?3>  then 
we  can  express  the  Lagrangian  function  L  =  T  —  V  in  terms  of  91,^2,93 
and  their  time  derivatives,  91  >  92)  93-  The  action  integral  must  then  be 
stationary  with  respect  to  arbitrary  variations  5q\,5q2,8qz,  subject  to  the 
conditions  that  the  variations  are  zero  at  the  limits  to  and  t\.  Thus  we 
obtain  Lagrange's  equations: 

d  / 8L\      dL  „  o  ,„  ^ 

dH%)  =  %'  i=w-  (3-44) 


These  are  the  equations  of  motion  in  terms  of  the  co-ordinates  gi,  92,  93- 
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By  analogy  with  (3.41),  we  may  define  the  generalized  momenta  and 
generalized  forces, 


dL  n  dL 

p2  =  —       and       Fi  =  —,  3.45 


so  that  the  equations  (3.44)  read 

Pi  =  Fi.  (3.46) 

It  must  be  emphasized,  however,  that  in  general  the  pt  and  Ft  are  not 
components  of  the  momentum  vector  p  and  the  force  vector  F  —  we  shall 
see  this  explicitly  in  the  examples  below. 

To  see  how  this  works  in  a  simple  case,  let  us  return  to  the  simple 
pendulum  discussed  in  §2.1. 

Example:  Simple  pendulum  revisited 

Find  the  equation  of  motion  for  a  simple  pendulum. 
Using  the  potential  energy  function  (2.9),  we  find 
L  =  T-V  =  \ml292  -  mgl{\  -  cos  (9). 

Thus 

dL        120         dL  1  ■  a 

— r  =  ml  a,        -p-r  =  —mqlsmv, 

and  Lagrange's  equation  yields 

ml  Q  =  —mglsmO. 

Note  that  to  find  this  equation  of  motion,  we  have  not  needed  expressions 
for  the  polar  components  of  acceleration,  as  we  would  to  derive  it  directly 
from  Newton's  laws;  nor  have  we  needed  to  introduce  the  tension  in  the 
rod. 


We  can  also  apply  these  equations  to  the  polar  coordinates  discussed 
in  §3.5.  In  the  cylindrical  polar  case,  the  derivatives  of  the  kinetic  energy 
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function  (3.31)  are 

dT  dT         ,  dT 

-Q-  =  mP>  d^=mP^  -dI  =  mZ> 

dT  .2  „  <9T  „ 

Hence  Lagrange's  equations  (3.44)  take  the  form 

^(V*)  =  -^,  (3-47) 
d  W 

The  generalized  momenta  rap  and  mi  corresponding  to  the  co-ordinates 
p  and  z  are  the  components  of  the  momentum  vector  p  in  the  p  and  z 
directions.  However,  the  generalized  momentum  mp2p>  is  an  angular  mo- 
mentum; in  fact,  the  angular  momentum  Jz  about  the  z-axis.  It  is  easy 
to  see  that,  whenever  q{  is  an  angle,  the  corresponding  generalized  mo- 
mentum pi  has  the  dimensions  of  angular,  rather  than  linear,  momentum. 
Note  that  when  the  potential  energy  is  independent  of  p,  then  the  second 
of  Eqs.  (3.47)  expresses  the  fact  that  the  component  Jz  of  angular  momen- 
tum is  conserved.  One  must  be  careful,  when  using  the  notation  pv  to 
specify  whether  this  is  the  ip  component  of  the  momentum  vector  p  or  the 
<p  generalized  momentum,  Jz. 

Proceeding  in  a  similar  way,  we  can  write  down  the  equations  of  motion 
in  spherical  polars,  using  the  derivatives  of  the  kinetic  energy  function 
(3.32).  We  find 

—  (mr)  =  mr(92  +  sin2  9  p>2)  -  — , 
at  or 

A (mr2e)  =  mr2  sin  9  cos  6  p2  -  ,  (3.48) 
dt  du 

d  /      2   •   2  n  -\  9V 

—  (mr  sin  Up)  = 

dt v  y'  dp 

Once  again,  the  generalized  momentum  mr2  sin2  8  p  is  the  component  of 
angular  momentum  Jz  about  the  2-axis,  and  is  a  constant  if  V  is  indepen- 
dent of  p.  The  generalized  momentum  mr29  also  has  the  dimensions  of 
angular  momentum.  Since  it  is  equal  to  the  radial  distance  r  multiplied 
by  the  9  component  of  momentum,  mr9,  it  is  in  fact  the  p  component  of 
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angular  momentum,  Jv.  Note,  however,  that  even  when  V  is  independent 
of  both  angular  variables  9  and  ip,  as  is  the  case  for  a  central  force,  Jv  is 
not  in  general  a  constant.  This  is  because  the  <p  direction  is  not  a  fixed 
direction  in  space,  like  the  z-axis,  but  changes  according  to  the  position  of 
the  particle.  Thus,  even  when  J  is  a  fixed  vector,  the  component  Jv  may 
vary. 

Before  concluding  this  section,  it  will  be  useful  to  discuss  briefly  the 
special  case  of  a  central  conservative  force,  which  is  treated  in  detail  in  the 
following  chapter.  In  spherical  polars,  the  components  of  the  force  F  (not 
the  generalized  forces)  are  given  in  terms  of  the  potential  by 

Fr  =  _^;       Fe  =  -1-^,       F  =  l—  ™. 

dr  '  r  89'  rsmQ  dip 

(See  (A. 50).)  Thus  the  force  will  be  purely  radial  if,  and  only  if,  V  is  inde- 
pendent of  both  angular  variables  9  and  ip.  The  potential  energy  function 
(3.9)  of  the  three-dimensional  harmonic  oscillator  is  an  example.  Then  we 
have 


(3.49) 


This  shows  incidentally  that  the  magnitude  of  a  central,  conservative  force 
depends  only  on  r,  not  on  9  or  ip. 

We  know  from  the  discussion  of  §3.4  that  the  angular  momentum  vector 
J  in  this  case  is  a  constant,  and  that  the  motion  is  confined  to  a  plane.  The 
equations  of  motion  are  considerably  simplified  by  an  appropriate  choice  of 
the  polar  axis  9  =  0.  If  it  is  chosen  so  that  ip  =  0  initially,  then  by  the 
third  of  Eqs.  (3.48),  ip  will  always  be  zero.  The  motion  is  then  in  a  plane 
ip  =  constant,  through  the  polar  axis,  in  which  r  and  9  play  the  role  of 
plane  polar  co-ordinates.  The  angular  momentum  J  =  Jv  =  mr29  is  then 
a  constant,  by  the  second  of  Eqs.  (3.48). 

Similarly,  if  the  particle  is  originally  moving  in  the  'equatorial'  plane, 
#  =  §  with  9  =  0,  then  from  the  second  equation  it  follows  that  9  is  always 
^.  In  this  case,  the  plane  polar  co-ordinates  are  r  and  ip,  and  the  angular 
momentum  is  J  =  Jz  =  rnr2ip. 

3.8  Summary 


The  rate  of  change  of  kinetic  energy  of  a  particle  is  the  rate  at  which  the 
force  does  work,  T  =  r  ■  F.  The  force  is  conservative  if  it  has  the  form 
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F  =  — W(j"),  and  in  that  case  the  total  energy  T  +  V  is  a  constant.  The 
condition  for  the  existence  of  such  a  function  is  that  F  should  be  a  function 
of  position  such  that  V  A  F  =  0. 

The  rate  of  change  of  angular  momentum  of  a  particle  is  equal  to  the 
moment  of  the  force,  J  =  r  A  F.  When  the  force  is  central,  the  angular 
momentum  is  conserved.  Then  the  motion  is  confined  to  a  plane,  and  the 
rate  of  sweeping  out  area  in  this  plane  is  a  constant. 

The  use  of  these  conservation  laws  greatly  simplifies  the  treatment  of 
any  problem  involving  central  or  conservative  forces.  When  the  force  is 
both  central  and  conservative,  they  provide  all  the  information  we  need 
to  determine  the  motion  of  the  particle,  as  we  shall  see  in  the  following 
chapter. 

Lagrange's  equations  are  of  great  importance  in  advanced  treatments  of 
classical  mechanics  (and  also  in  quantum  mechanics).  We  have  seen  that 
they  can  be  used  to  write  down  equations  of  motion  in  any  system  of  co- 
ordinates, as  soon  as  we  have  found  the  expressions  for  the  kinetic  energy 
and  potential  energy.  In  later  chapters,  we  shall  see  that  the  method  can 
readily  and  usefully  be  extended  to  more  complicated  systems  than  a  single 
particle. 


Problems 

1.  Find  which  of  the  following  forces  are  conservative,  and  for  those  that 
are  find  the  corresponding  potential  energy  function  (a  and  b  are  con- 
stants, and  a  is  a  constant  vector): 

(a)  Fx=ax  +  by2,     Fy  =  az  +  2bxy,     Fz  =  ay  +  bz2; 

(b)  Fx  =  ay,     Fy  =  az,     Fz  =  ax; 

(c)  Fr  =  2ar  sin  9  sin  tp,     Fg  =  ar  cos  8  sin  ip,     Fv  =  ar  cos  ip; 

(d)  F  =  aAr; 

(e)  F  =  ra; 

(f)  F  =  a(a-r). 

2.  Given  that  the  force  is  as  in  Problem  1(a),  evaluate  the  work  done  in 
taking  a  particle  from  the  origin  to  the  point  (1, 1,0):  (i)  by  moving 
first  along  the  a;-axis  and  then  parallel  to  the  y-axis,  and  (ii)  by  going 
in  a  straight  line.  Verify  that  the  result  in  each  case  is  equal  to  minus 
the  change  in  the  potential  energy  function. 

3.  Repeat  the  calculations  of  Problem  2  for  the  force  in  1(b). 

4.  Compute  the  work  done  in  taking  a  particle  around  the  circle  x2  +y2  = 
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a2,z  =  0  if  the  force  is  (a)  F  =  yi,  and  (b)  F  =  xi.  What  do  you 
conclude  about  these  forces?  (Use  the  parametrization  x  =  a  cos  ip,y  = 
a  sin  ip,  z  =  0.) 

5.  Evaluate  the  force  corresponding  to  the  potential  energy  function 
V(r)  =  cz/r3,  where  c  is  a  constant.  Write  your  answer  in  vector  nota- 
tion, and  also  in  spherical  polars,  and  verify  that  it  satisfies  VAF  =  0. 

6.  A  projectile  is  launched  with  velocity  100ms-1  at  60°  to  the  horizontal. 
Atmospheric  drag  is  negligible.  Find  the  maximum  height  attained  and 
the  range.  What  other  angle  of  launch  would  give  the  same  range?  Find 
the  time  of  flight  in  each  of  the  two  cases. 

7.  *Find  the  equation  for  the  trajectory  of  a  projectile  launched  with  ve- 
locity v  at  an  angle  a  to  the  horizontal,  assuming  negligible  atmospheric 
resistance.  Given  that  the  ground  slopes  at  an  angle  (3,  show  that  the 
range  of  the  projectile  (measured  horizontally)  is 

2v2  sin(a  —  (3)  cos  a 

x  =  a  • 

g  cos  p 

At  what  angle  should  the  projectile  be  launched  to  achieve  the  maxi- 
mum range? 

8.  *By  expanding  the  logarithm  in  (3.17),  find  the  approximate  equation 
for  the  trajectory  of  a  projectile  subject  to  small  atmospheric  drag  to 
first  order  in  7.  (Note  that  this  requires  terms  up  to  order  73  in  the 
logarithm.)  Show  that  to  this  order  the  range  (on  level  ground)  is 

_  2uw  8juw2 
9  3g2 

and  hence  that  to  maximize  the  range  for  given  launch  speed  v  the  angle 
of  launch  should  be  chosen  to  satisfy  cos  2a  =  ^/2~fv/3g.  (Hint:  In  the 
term  containing  7,  you  may  use  the  zeroth-order  approximation  for  the 
angle.)  For  a  projectile  whose  terminal  speed  if  dropped  from  rest  (see 
Chapter  2,  Problem  13)  would  be  500ms_1,  estimate  the  optimal  angle 
and  the  range  if  the  launch  speed  is  100  ms-1. 

9.  *Show  that  in  the  limit  of  strong  damping  (large  7)  the  time  of  flight 
of  a  projectile  (on  level  ground)  is  approximately  t  fa  (w/g  +  1/7)(1  — 
e-i--fw/gy  ghow  tnat  to  the  same  order  of  accuracy  the  range  is  x  « 
(u/7)(l  —  e~x~lwl9).  For  a  projectile  launched  at  800 ms_1  with  7  = 

0.  1  s_1,  estimate  the  range  for  launch  angles  of  30°,  20°  and  10°. 

10.  A  particle  of  mass  m  is  attached  to  the  end  of  a  light  string  of  length 

1.  The  other  end  of  the  string  is  passed  through  a  small  hole  and  is 
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slowly  pulled  through  it.  Gravity  is  negligible.  The  particle  is  originally 
spinning  round  the  hole  with  angular  velocity  w.  Find  the  angular 
velocity  when  the  string  length  has  been  reduced  to  \l.  Find  also  the 
tension  in  the  string  when  its  length  is  r,  and  verify  that  the  increase  in 
kinetic  energy  is  equal  to  the  work  done  by  the  force  pulling  the  string 
through  the  hole. 

11.  A  particle  of  mass  m  is  attached  to  the  end  of  a  light  spring  of  equi- 
librium length  a,  whose  other  end  is  fixed,  so  that  the  spring  is  free  to 
rotate  in  a  horizontal  plane.  The  tension  in  the  spring  is  k  times  its 
extension.  Initially  the  system  is  at  rest  and  the  particle  is  given  an 
impulse  that  starts  it  moving  at  right  angles  to  the  spring  with  velocity 
v.  Write  down  the  equations  of  motion  in  polar  co-ordinates.  Given 
that  the  maximum  radial  distance  attained  is  2a,  use  the  energy  and 
angular  momentum  conservation  laws  to  determine  the  velocity  at  that 
point,  and  to  find  v  in  terms  of  the  various  parameters  of  the  system. 
Find  also  the  values  of  r  when  r  =  a  and  when  r  =  2a. 

12.  *A  light  rigid  cylinder  of  radius  2a  is  able  to  rotate  freely  about  its 
axis,  which  is  horizontal.  A  particle  of  mass  m  is  fixed  to  the  cylinder 
at  a  distance  a  from  the  axis  and  is  initially  at  rest  at  its  lowest  point. 
A  light  string  is  wound  on  the  cylinder,  and  a  steady  tension  F  applied 
to  it.  Find  the  angular  acceleration  and  angular  velocity  of  the  cylinder 
after  it  has  turned  through  an  angle  9.  Show  that  there  is  a  limiting 
tension  Fq  such  that  if  F  <  Fq  the  motion  is  oscillatory,  but  if  F  > 
Fo  it  continues  to  accelerate.  Estimate  the  value  of  Fq  by  numerical 
approximation. 

13.  In  the  system  of  Problem  12,  instead  of  a  fixed  tension  applied  to  the 
string,  a  weight  of  mass  m/2  is  hung  on  it.  Use  the  energy  conservation 
equation  to  find  the  angular  velocity  as  a  function  of  9.  Find  also  the 
angular  acceleration  and  the  tension  in  the  string.  (Compare  your 
results  with  those  in  Problem  12.)  Show  that  there  is  a  point  at  which 
the  tension  falls  to  zero,  and  find  the  angle  at  which  this  occurs.  What 
happens  immediately  beyond  this  point? 

14.  A  wedge-shaped  block  of  mass  M  rests  on  a  smooth  horizontal  table.  A 
small  block  of  mass  m  is  placed  on  its  upper  face,  which  is  also  smooth 
and  inclined  at  an  angle  a  to  the  horizontal.  The  system  is  released 
from  rest.  Write  down  the  horizontal  component  of  momentum,  and 
the  kinetic  energy  of  the  system,  in  terms  of  the  velocity  v  of  the  wedge 
and  the  velocity  u  of  the  small  block  relative  to  it.  Using  conservation 
of  momentum  and  the  equation  for  the  rate  of  change  of  kinetic  energy, 
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find  the  accelerations  of  the  blocks.  Given  that  M  =  1  kg  and  m  = 
250  g,  find  the  angle  a  that  will  maximize  the  acceleration  of  the  wedge. 
15.  *A  particle  starts  from  rest  and  slides  down  a  smooth  curve  under 
gravity.  Find  the  shape  of  the  curve  that  will  minimize  the  time  taken 
between  two  given  points.  [Take  the  origin  as  the  starting  point  and 
the  z  axis  downwards.  Show  that  the  time  taken  is 


where  c  is  an  integration  constant.  To  complete  the  integration,  use 
the  substitution  z  =  c~2  sin2  6.  This  famous  curve  is  known  as  the 
brachistochrone.  It  is  in  fact  an  example  of  a  cycloid,  the  locus  of  a 
point  on  the  rim  of  a  circle  of  radius  l/2c2  being  rolled  beneath  the 
a;- axis. 

This  problem  was  first  posed  by  Johann  Bernoulli  on  New  Year's  Day 
1697  as  an  open  challenge.  Newton's  brilliant  solution  method  initiated 
the  calculus  of  variations.  Bernoulli  had  an  equally  brilliant  idea,  using 
an  optical  analogy  with  refraction  of  a  light  ray  through  a  sequence  of 
plates  and  Fermat's  principle  of  least  time.] 
16.  *The  position  on  the  surface  of  a  cone  of  semi-vertical  angle  a  is  spec- 
ified by  the  distance  r  from  the  vertex  and  the  azimuth  angle  ip  about 
the  axis.  Show  that  the  shortest  path  (or  geodesic)  along  the  surface 
between  two  given  points  is  specified  by  a  function  r(tp)  obeying  the 
equation 


Show  that  the  solution  is  r  =  rosec[(cp  —  tpo)sma],  where  ro  and  ipo 
are  constants.  [The  equation  may  be  solved  by  the  standard  technique 
of  introducing  a  new  dependent  variable  u  =  dr/dtp,  and  writing 


The  substitution  u2  =  v  then  reduces  the  equation  to  a  linear  form  that 
may  be  solved  by  using  an  integrating  factor  (see  Chapter  2,  Problem 


and  hence  that  for  a  minimum 


d2r  du  dr  du  du 
dip2      dtp      dp  dr  dr 
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13).  Finally,  to  calculate  cp  =  J(l/u)dr,  one  may  use  the  substitution 
r  =  1/x.] 

17.  *Find  the  geodesies  on  a  sphere  of  unit  radius.  [Hint:  Use  9  as  in- 
dependent variable,  and  look  for  the  path  tp  =  tp(6).  To  perform  the 
integration,  use  the  substitution  x  =  cot  9.] 

18.  * Parabolic  co-ordinates  (f,  rf)  in  a  plane  are  defined  by  £  =  r  +  x,n  = 
r  —  x.  Find  x  and  y  in  terms  of  £  and  rj.  Show  that  the  kinetic  energy 
of  a  particle  of  mass  m  is 


Hence  find  the  equations  of  motion. 

19.  Write  down  the  equations  of  motion  in  polar  co-ordinates  for  a  particle 
of  unit  mass  moving  in  a  plane  under  a  force  with  potential  energy 
function  V  =  —klnr  +  cr  +  gr  cos  0,  where  k,c  and  g  are  positive 
constants.  Find  the  positions  of  equilibrium  (a)  if  c  >  g,  and  (b) 
if  c  <  g.  By  considering  the  equations  of  motion  near  these  points, 
determine  whether  the  equilibrium  is  stable  (i.e.,  will  the  particle,  if 
given  a  small  displacement,  tend  to  return  repeatedly?). 

20.  If  91,52,93  are  orthogonal  curvilinear  co-ordinates,  and  the  element  of 
length  in  the  qi  direction  is  hidqt,  write  down  (a)  the  kinetic  energy  T 
in  terms  of  the  generalized  velocities  qi,  (b)  the  generalized  momentum 
Pi  and  (c)  the  component  e4  •  p  of  the  momentum  vector  p  in  the  qi 
direction.  (Here  e,  is  a  unit  vector  in  the  direction  of  increasing  qi.) 

21.  *By  comparing  the  Euler-Lagrange  equations  with  the  corresponding 
components  of  the  equation  of  motion  mr  =  -VV,  show  that  the 
component  of  the  acceleration  vector  in  the  qi  direction  is 


Use  this  result  to  identify  the  components  of  the  acceleration  in  cylin- 
drical and  spherical  polars. 

22.  For  the  case  of  plane  polar  co-ordinates  r,  6,  write  the  unit  vectors  er 
(=  f)  and  eg  in  terms  of  i  and  j.  Hence  show  that  der/d9  =  eg  and 
deg/d9  =  —  er.  By  starting  with  r  =  rer  and  differentiating,  rederive 
the  expressions  for  the  components  of  the  velocity  and  acceleration 
vectors. 

23.  *Find  the  corresponding  formulae  for  dei/dqj  for  spherical  polar  co- 
ordinates, and  hence  verify  the  results  obtained  in  Problem  21. 
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24.  *The  motion  of  a  particle  in  a  plane  may  be  described  in  terms  of 
elliptic  co-ordinates  A,  8  defined  by 

x  =  c  cosh  A  cos  0,        y  =  csinh  Asin#,        (A  >  0,  0  <  9  <  2-k), 

where  c  is  a  positive  constant.  Show  that  the  kinetic  energy  function 
may  be  written 

T  =  ±mc2(cosh2  A  -  cos2  6>)(A2  +  92). 

Hence  write  down  the  equations  of  motion. 

25.  *The  method  of  Lagrange  multipliers  (see  Appendix  A,  Problem  11) 
can  be  extended  to  the  calculus  of  variations.  To  find  the  maxima  and 
minima  of  an  integral  /,  subject  to  the  condition  that  another  integral 
J  =  0,  we  have  to  find  the  stationary  points  of  the  integral  /  —  A  J 
under  variations  of  the  function  y(x)  and  of  the  parameter  A.  Apply 
this  method  to  find  the  catenary,  the  shape  in  which  a  uniform  heavy 
chain  hangs  between  two  fixed  supports.  [The  required  shape  is  the  one 
that  minimizes  the  total  potential  energy,  subject  to  the  condition  that 
the  total  length  is  fixed.  Show  that  this  leads  to  a  variational  problem 
with 

f(y,yi)  =  (y-\)^TTy^, 

and  hence  to  the  equation 

(y-\)y"  =  l  +  y'\ 

Solve  this  equation  by  introducing  the  new  variable  u  =  y'  and  solving 
for  u(y)] 

26.  *A  curve  of  given  total  length  is  drawn  in  a  plane  between  the  points 
(±a,0).  Using  the  method  of  Problem  25,  find  the  shape  that  will 
enclose  the  largest  possible  area  between  the  curve  and  the  a;-axis. 


Chapter  4 

Central  Conservative  Forces 


The  most  important  forces  of  this  type  are  the  gravitational  and  electro- 
static forces,  which  obey  the  'inverse  square  law'.  Much  of  this  chapter  will 
therefore  be  devoted  to  this  special  case.  We  begin,  however,  by  discussing 
a  problem  for  which  it  is  particularly  easy  to  solve  the  equations  of  motion 
—  the  three-dimensional  analogue  of  the  harmonic  oscillator  discussed  in 
§2.2. 

4.1    The  Isotropic  Harmonic  Oscillator 

We  consider  a  particle  moving  under  the  action  of  a  central  restoring  force 
proportional  to  its  distance  from  the  origin, 

F  =  -kr,  (4.1) 

with  k  constant.  The  corresponding  equation  of  motion  is 

mr  +  kr  =  0, 

or,  in  terms  of  components, 

mx  +  kx  =  0,        my  +  ky  =  0,        m'z  +  kz  =  0. 

The  equation  of  motion  for  each  co-ordinate  is  thus  identical  with  the 
equation  (2.13)  for  the  simple  harmonic  oscillator.  This  oscillator  is  called 
isotropic  because  all  directions  (not  only  those  along  the  axes!)  are  equiv- 
alent. The  anisotropic  oscillator  (which  we  shall  not  discuss)  is  described 
by  similar  equations,  but  with  different  constants  in  the  three  equations. 
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The  general  solution  is  again  given  by  (2.19): 

x  =  cx  cos  cut  +  dx  sinwi, 
y  =  cy  cos  wt  +  dy  sin  cot, 
z  =  cz  cos  uit  +  dz  sin  uit , 

where  ui  =  fc/m.  In  vector  notation,  the  solution  is 

r  =  c  cos  uit  +  d  sin  uit.  (4-2) 

Clearly,  the  motion  is  periodic,  with  period  r  =  2-it/ui.  (Other  forms  of  the 
solution  can  be  given,  analogous  to  (2.20)  or  (2.23).) 

As  in  the  one-dimensional  case,  the  arbitrary  constant  vectors  c  and  d 
can  be  fixed  by  the  initial  conditions.  If  at  t  =  0  the  particle  is  at  ro  and 
moving  with  velocity  i>o,  then 

c  =  r0,       d  =  v0/ui.  (4.3) 

We  can  easily  verify  the  conservation  laws  for  angular  momentum  and 
energy.  From  (4.2)  it  is  clear  that  r  always  lies  in  the  plane  of  c  and  d,  so 
that  the  direction  of  J  is  fixed.  Using 

r  =  —ujcsaiuit  +  uidcosuit, 

we  find  explicitly 

J  =  rnr  Af  =  rnuic  Ad  =  mr0  A  vq,  (4.4) 

which  is  obviously  a  constant. 

The  potential  energy  function  corresponding  to  the  force  (4.1)  —  or 
(3.10)  —  is  just  (3.9),  namely 

V=  \kr2  =  \k(x2 +y2  +  z2).  (4.5) 

Thus,  evaluating  the  energy,  we  find 

E  =  \mr2  +  \kr2  =  \k{c2  +  d2)  =  \mv2Q  +  \kr2Q,  (4.6) 

which  is  again  a  constant. 

To  find  the  shape  of  the  particle  orbit,  it  is  convenient  to  rewrite  (4.2) 
in  a  slightly  different,  but  equivalent,  form.  If  6  is  any  fixed  angle,  we  can 
write 

r  =  acos(u)t- 6) +  bsin(ujt- 0),  (4.7) 
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where 


or,  equivalently, 


c  =  acos9  —  6sin#, 
d  =  as'md  +  bcosO. 


(4.8) 


a  =  c  cos  9  +  dsmO, 
b  =  —csinO  +  dcos9. 

We  can  now  choose  9  so  that  a  and  b  are  perpendicular.  This  requires 

0  =  ab=  -(c2  -d2)sin6lcos6'  +  c-d(cos26'-sin2  6), 

or 

tan  26* 


2c  -  d 


!-d2 

We  are  now  free  to  choose  our  axes  so  that  the  X-axis  is  in  the  direction  of 
a  and  the  j/-axis  in  the  direction  of  b.  Equation  (4.7)  then  becomes 

x  =  acos(ujt  —  6),        y  =  bsm(cut  —  9),        2  =  0. 

The  equation  of  the  orbit  is  obtained  by  eliminating  the  time  from  these 
equations.  It  is 


+ 
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0. 


(4.9) 


This  is  the  well-known  equation  of  an  ellipse  with  centre  at  the  origin,  and 
semi-axes  a  and  b.  (See  Appendix  B,  Eq.  (B.2),  and  Fig.  4.1.)  The  vectors  c 
and  d  are  what  is  known  in  geometry  as  a  pair  of  conjugate  semi-diameters 
of  the  ellipse. 


Fig.  4.1 
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The  simplest  way  to  determine  the  magnitudes  of  the  semi-axes  a  and 
b  from  the  initial  conditions  (that  is,  from  c  and  d)  is  to  use  the  constancy 
of  E  and  J.  Equating  the  expressions  for  these  quantities,  (4.6)  and  (4.4), 
to  the  corresponding  expressions  in  terms  of  a  and  b,  we  obtain 

a  A  b  =  c  A  d. 

(These  equations  may  also  be  obtained  directly  from  (4.8).)  If  a  is  the 
angle  between  c  and  d,  the  second  equation  yields  ab  =  cdsina.  Hence 
eliminating  b  (or  a)  between  these  two  equations,  we  obtain  for  a2  (or  b2) 
the  quadratic  equation 

a4  -  (c2  +  d2)a2  +  c2d2  sin2  a  =  0.  (4.10) 

Conventionally,  a  is  taken  to  be  the  larger  root,  and  called  the  semi-major 
axis. 


4.2    The  Conservation  Laws 


We  now  consider  the  general  case  of  a  central,  conservative  force.  It  corre- 
sponds, as  we  saw  in  §3.7,  to  a  potential  energy  function  V(r)  depending 
only  on  the  radial  co-ordinate  r.  There  are  two  conservation  laws,  one  for 
energy, 

\mr2  +  V(r)  =  E  =  constant, 
and  one  for  angular  momentum 

mr  A  r  =  J  =  constant. 


According  to  the  discussion  of  §3.4,  the  second  of  these  laws  implies  that 
the  motion  is  confined  to  a  plane,  so  that  the  problem  is  effectively  two- 
dimensional.  Introducing  polar  co-ordinates  r,  8  in  this  plane,  we  may  write 
the  two  conservation  laws  in  the  form 

im(r2  +  r262)  +  V(r)=E, 

mr29  =  J.  (  ' 


A  great  deal  of  information  about  the  motion  can  be  obtained  directly 
from  these  equations,  without  actually  having  to  solve  them  to  find  r  and 
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9  as  functions  of  the  time.  We  note  that  9  may  be  eliminated  to  yield  an 
equation  involving  only  r  and  r: 

We  shall  call  this  the  radial  energy  equation.  For  a  given  value  of  J,  it  has 
precisely  the  same  form  as  the  one-dimensional  energy  equation  with  an 
effective  potential  energy  function 

It  is  easy  to  understand  the  physical  significance  of  the  extra  term  J2 /2mr2 
in  this  'potential  energy'  function.  It  corresponds  to  a  'force'  J2 /mr3 .  This 
is  precisely  the  'centrifugal  force'  rnr92  (the  first  term  on  the  right  hand 
side  of  the  radial  equation  in  (3.48)),  expressed  in  terms  of  the  constant  J 
rather  than  the  variable  9. 

We  can  use  the  radial  energy  equation  just  as  we  did  in  §2.1.  Since  r2 
is  positive,  the  motion  is  limited  to  the  range  of  values  of  r  for  which 

U(r)  =  ^  +  V(r)<E.  (4.14) 

The  maximum  and  minimum  radial  distances  are  given  by  the  values  of  r 
for  which  the  equality  holds. 


Fig.  4.2 

As  an  example,  let  us  consider  again  the  case  of  the  isotropic  harmonic 
oscillator,  for  which  V(r)  =  ^kr2.  The  corresponding  effective  potential 
energy  function  U(r)  is  shown  in  Fig.  4.2.  It  has  a  minimum  (corresponding 
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to  a  position  of  stable  equilibrium  in  the  one-dimensional  case)  at 


(4.15) 


When  the  value  of  E  is  equal  to  the  minimum  value  oiU,r  must  always  be 
zero,  and  r  is  fixed  at  the  minimum  position.  In  this  case,  the  particle  must 
move  in  a  circle  around  the  origin.  It  is  interesting  to  note  that  we  could 
also  obtain  (4.15)  by  equating  the  attractive  force  kr  to  the  centrifugal  force 
in  the  circular  orbit,  J2/mr3. 

For  any  larger  value  of  E,  the  motion  is  confined  to  the  region  b  <  r  <  a 
between  the  two  limiting  values  of  r,  given  by  the  solutions  of  the  equality 
in  (4.14).  If  the  particle  is  initially  at  a  distance  ro  from  the  origin,  and 
moving  with  velocity  vo  in  a  direction  making  an  angle  a  with  the  radius 
vector  (as  in  Fig.  4.1),  then  the  values  of  E  and  J  are 

E  =  |m«Q  +  |fcrQ,        J  =  mrovo  sin  a. 

Tims  the  equation  whose  roots  are  a  and  b  becomes  (on  multiplying  by 
2r2/k) 

r4  ~  (rg  +  jv^j  r2  +  jr2v2  sin2  a  =  0. 
By  (4.3),  this  is  identical  with  the  equation  (4.10)  found  previously. 


4.3    The  Inverse  Square  Law 

We  now  consider  a  force 

F  -  —  r 

where  k  is  a  constant.    The  corresponding  potential  energy  function  is 
V(r)  =  k/r.  The  constant  k  may  be  either  positive  or  negative;  in  the  first 
case,  the  force  is  repulsive,  and  in  the  second,  attractive. 
The  radial  energy  equation  for  this  case  is 

\mf2  +  +  -=E.  (4.16) 

It  corresponds  to  the  'effective  potential  energy'  function 


U(  )  -\-  ^ 

2mr2  r 
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Repulsive  case 

We  suppose  first  that  k  >  0.  Then  U (r)  decreases  monotonically  from  +00 
at  r  =  0  to  0  at  r  =  00  (see  Fig.  4.3).  Thus  it  has  no  minima,  and  circular 


U 


Fig.  4.3 

motion  is  impossible,  as  is  physically  obvious.  For  any  positive  value  of  E, 
there  is  a  minimum  value  of  r,  n  say,  which  is  the  unique  positive  root  of 
U(r)  =  E,  but  no  maximum  value.  If  the  radial  velocity  is  initially  inward, 
the  particle  must  follow  an  orbit  on  which  r  decreases  to  r\  (at  which  point 
the  velocity  is  purely  transverse),  and  then  increases  again  without  limit. 
As  is  well  known,  and  as  we  shall  show  in  the  next  section,  the  orbit  is 
actually  a  hyperbola. 

Let  us  consider  an  example. 

Example:  Distance  of  closest  approach 

A  charged  particle  of  charge  q  moves  in  the  field  of  a  fixed  point 
charge  q' ,  with  qq'  >  0.  Initially  it  is  approaching  the  centre  of 
force  with  velocity  v  (at  a  large  distance)  along  a  path  which,  if 
continued  in  a  straight  line,  would  pass  the  centre  at  a  distance 
b.  Find  the  distance  of  closest  approach. 

This  distance  b  is  known  as  the  impact  parameter,  and  will  appear 
frequently  in  our  future  work.  (See  Fig.  4.4.)  Since  the  particle  starts 
from  a  great  distance,  its  initial  potential  energy  is  negligible.  (Note  that 
we  have  chosen  the  arbitrary  additive  constant  in  V  so  that  V(oo)  =  0.) 
Thus  the  total  energy  is  simply 


E  =  \mv2. 


(4.17) 
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Fig.  4.4 


Moreover,  since  the  component  of  r  perpendicular  to  v  is  b,  the  angular 
momentum  is 

J  =  mvb.  (4.18) 

The  distance  of  closest  approach,  r±,  is  obtained  by  substituting  these 
values,  and  k  =  qq'f-iiTeQ,  into  the  radial  energy  equation,  and  then 
setting  r  =  0.  This  yields 

r\  -  2an  -  b2  =  0.        where       a  =  — — — 7.  (4.19) 

The  required  solution  is  the  positive  root 

n=a+  \/a2  +  b2.  (4.20) 


Attractive  case 

We  now  suppose  that  k  <  0,  as  in  the  gravitational  case.  It  will  be  useful 
to  define  a  quantity  I,  with  the  dimensions  of  length,  by 

Then  the  effective  potential  energy  function  is 
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Fig.  4.5 

Here,  different  types  of  motion  are  possible  according  to  the  value  of  E. 
We  may  distinguish  four  cases: 

(.» 

This  is  the  minimum  value  of  U.  Hence,  r  must  always  be  zero,  and 
the  particle  must  move  in  a  circle  of  radius  I.  It  is  easy  to  find  the  orbital 
velocity  v  from  the  kinetic  energy  T  =  E  —  V .  Since  the  potential  energy 
is  V  =  —\k\/l,  we  must  have 

T  =  whence       v  =  J^.  (4.22) 

21  V  ml  y  ' 

This  can  also  be  found  by  equating  the  attractive  force  \k\/l2  to  the  'cen- 
trifugal force'  mv2/l. 

Note  the  interesting  result  that  for  a  circular  orbit  the  potential  energy 
is  always  twice  as  large  in  magnitude  as  the  kinetic  energy  (see  Problem  21). 

(b)  -§<£<o. 

This  case  is  illustrated  by  the  lower  line  (E)  in  Fig.  4.5.  The  radial 
distance  is  limited  between  a  minimum  distance  r\  and  a  maximum  distance 
T2-  As  we  shall  see,  the  orbit  is  in  fact  an  ellipse. 


82 


Classical  Mechanics 


(c)  E  =  0. 

In  this  case,  there  is  a  minimum  distance,  r\  =  \l,  but  the  maximum 
distance  ri  is  infinite.  Thus  the  particle  has  just  enough  energy  to  escape 
to  infinity,  with  kinetic  energy  tending  to  zero  at  large  distances.  The  orbit 
will  be  shown  to  be  a  parabola. 

(d)  E  >  0. 

This  corresponds  to  the  upper  line  (E')  in  Fig.  4.5.  Again,  there  is  a 
minimum  distance  but  no  maximum  distance.  Now,  however,  the  particle 
can  escape  to  infinity  with  non-zero  limiting  velocity.  The  orbit  is  in  fact  a 
hyperbola. 

As  a  simple  example  of  the  use  of  these  equations,  we  compute  the 
escape  velocity  from  the  Earth. 

Example:  Escape  velocity 

What  is  the  minimum  velocity  with  which  a  projectile  launched 
from  the  surface  of  the  Earth  (taken  to  be  a  sphere  of  mass  M 
and  radius  R)  can  escape,  and  how  does  this  depend  on  the  angle 
of  launch? 

Suppose  the  projectile  is  launched  with  velocity  v  at  an  angle  a  to 
the  vertical.  The  energy  and  angular  momentum  are 

GMm 
~~R 

where  G  is  Newton's  gravitational  constant.  To  express  the  energy  in 
terms  of  more  familiar  quantities,  we  note  that  the  gravitational  force 
on  a  particle  at  the  Earth's  surface  is 

GMm 

mg  =  — , 

from  which  we  obtain  the  useful  result 

GM  =  R2g.  (4.24) 

Thus  E  =  \mv2  —  Rgm.  The  projectile  will  escape  to  infinity  provided 
that  E  >  0;  or  equivalently  that  its  velocity  exceeds  the  escape  velocity, 


E=\mv2  - — ,        J  =  mRvsina,  (4.23) 
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Note  that  this  condition  is  independent  of  the  angle  of  launch  a  (so 
long  as  we  neglect  atmospheric  drag).  Using  the  values  R  =  6370km, 
g  =  9.81ms-2,  we  find  for  the  escape  velocity  from  the  Earth 

ve  =  11.2  kms-1. 

If  the  projectile  is  launched  with  a  speed  less  than  escape  velocity,  it 
will  reach  a  maximum  height  and  then  fall  back.  (This  is  a  generalization 
of  the  problem  discussed  in  §3.2.)  The  problem  of  finding  this  distance 
provides  another  useful  example. 

Example:  Maximum  height  of  projectile 

If  the  launch  speed  v  is  equal  to  the  circular  orbit  speed  vc  in  an 
orbit  just  above  the  Earth's  surface,  how  high  will  the  projectile 
reach  for  a  given  angle  of  launch  a? 

The  maximum  distance  T2  from  the  centre  is  the  larger  root  of  the 
equation  U(r)  =  E,  or,  with  the  values  (4.23)  for  E  and  J, 

(2Rg  -  v2)r2  -  2R2gr  +  R2v2  sin2  a  =  0. 

The  circular  orbit  velocity,  by  (4.22)  and  (4.24),  is 

vc  =  yj  Rg  =  7.9  kms-1. 

Thus  the  equation  reduces  to  r2  —  2Rr  +  R2  sin2  a  =  0,  so  that 

r-i  =  R(l  +  cos  a). 

For  a  vertical  launch,  the  maximum  distance  is  2R.  For  any  other  angle, 
it  is  less,  and  in  the  limit  of  an  almost  horizontal  launch,  the  orbit  is 
nearly  a  circle  of  radius  R. 

This  example  illustrates  the  kind  of  problem  which  may  readily  be  solved 
by  using  the  radial  energy  equation.  It  is  particularly  useful  when  we  are 
interested  only  in  r,  and  not  in  the  polar  angle  0  or  the  time  t. 

Energy  levels  of  the  hydrogen  atom 

This  is  not  a  problem  that  can  be  solved  by  the  methods  of  classical  mechan- 
ics alone.  However,  the  energy  levels  derived  from  quantum  mechanics  can 
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also  be  obtained  by  imposing  on  the  classical  orbits  an  ad  hoc  'quantization 
rule'.  Indeed,  historically,  this  is  how  they  were  first  obtained.  According 
to  Bohr's  'old  quantum  theory',  the  electron  in  an  atom  cannot  occupy  any 
orbit,  but  only  a  certain  discrete  set  of  orbits.  In  the  case  of  circular  orbits, 
the  quantization  rule  he  imposed  was  that  the  angular  momentum  J  be  an 
integer  multiple  of  h  (Planck's  constant  divided  by  2n).  The  constant  k  in 
this  case  is  k  =  —e2/4nea,  where  e  is  the  electric  charge  on  the  proton,  or 
minus  the  charge  on  the  electron.  Thus  for  J  =  nh  the  radius  an  of  the 
orbit  is,  by  (4.21), 

47T60J2  2 

an  =   5—  =  n  ai, 

me1 

where  01,  the  radius  of  the  first  Bohr  orbit  (usually  denoted  by  do),  is 

ai  =  4![eop=  5  29  ><10_llm 
mez 

The  corresponding  energy  levels  are 

En_       e2     _     1  e2 

8-7reoa„        n2  87reoai 

These  values  agree  well  with  the  energies  of  atomic  transitions,  as  deter- 
mined from  the  spectrum  of  hydrogen. 


4.4  Orbits 

We  now  turn  to  the  problem  of  determining  the  orbit  of  a  particle  moving 
under  a  central  conservative  force.  This  can  be  done  by  eliminating  the 
time  from  the  two  conservation  equations,  (4.11),  to  obtain  an  equation 
relating  r  and  6.  The  simplest  way  of  doing  this  is  to  work  not  with  r  itself, 
but  with  the  variable  u  =  1/r,  and  look  for  an  equation  determining  u  as 
a  function  of  6.  Now 

du         1  dr 
d^  =  ~^"de' 


whence 
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Thus  substituting  for  r  in  the  radial  energy  equation  (4.12),  we  obtain 

J2  fdu\2      J2    2     ^      „  /An  ^ 

2^{do)  +V  =  E>  (4'25) 

in  which  of  course  V  is  to  be  regarded  as  a  function  of  1/u.  This  equation 
can  be  integrated  (numerically,  if  not  analytically)  to  give  the  equation  of 
the  orbit. 

We  shall  consider  explicitly  only  the  case  of  the  inverse  square  law,  for 
which  V  =  ku.  We  shall  treat  both  cases  k  >  0  and  k  <  0  together,  by 
writing  V  =  ±\k\u.  It  will  be  useful,  as  in  the  preceding  section,  to  define 
the  length  parameter 

I  =  ^-  (4.26) 

rn\k\ 

Then,  multiplying  (4.25)  by  2/|fc|,  we  obtain 

le)  +lu±2u  =  W 

where  the  upper  sign  refers  to  the  repulsive  case,  k  >  0,  and  the  lower  to 
the  attractive  case,  k  <  0. 

To  solve  this  equation,  we  multiply  both  sides  by  I  and  add  1  to  'com- 
plete the  square'.  We  now  introduce  the  new  variable 

,  ,  dz  d« 

z  =  lu±l,        so  that       —  =  I  —  . 

dO  dO 

Then  we  may  write  the  equation  as 

dz\  2      2      2EI  2  ,„  . 

d^J    +Z  =W  +  1  =  6'  (4'27) 

say,  defining  a  new  dimensionless  constant  e.  Note  that  since  the  left  hand 
side  is  a  sum  of  squares,  it  can  have  a  solution  only  when  the  the  right  hand 
side  is  positive,  in  agreement  with  our  earlier  result  that  the  minimum  value 
of  E  is  -\k\/2l. 

The  general  solution  of  this  equation  is 

2  =  lu  ±  1  =  e  cos(#  —  9q), 


where  6q  is  an  arbitrary  constant  of  integration.  Thus  finally,  replacing 
u  by  1/r  and  multiplying  by  r,  we  find  that  the  orbit  equation  is,  in  the 
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repulsive  case 


r[e  cos(# 


e0)  -i]  =  i, 


{k  >  0) 


(4.28) 


and  in  the  attractive  case 


r[e  cos(0 


Go)  +  1]  =  I. 


(k  <  0) 


(4.29) 


These  are  the  polar  equations  of  conic  sections,  referred  to  a  focus  as  origin 
(see  Appendix  B).  The  constant  e,  the  eccentricity,  determines  the  shape 
of  the  orbit;  I,  called  the  semi-latus  rectum,  determines  its  scale;  and  9q  its 
orientation  relative  to  the  coordinate  axes. 

In  the  repulsive  case,  e  must  be  greater  than  unity,  since  otherwise  the 
square  bracket  in  (4.28)  is  always  negative.  Therefore,  by  (4.27),  E  must 
be  positive. 

In  the  attractive  case,  e  =  0  when  E  has  its  minimum  value,  —  \k\/2l; 
the  orbit  is  then  the  circle  r  =  I.  So  long  as  e  <  1,  or  E  <  0,  the  square 
bracket  in  (4.29)  is  always  positive,  and  there  is  a  value  of  r  for  every  9. 
Thus  the  orbit  is  closed.  When  e  >  1,  or  E  >  0,  r  can  become  infinite 
when  the  square  bracket  vanishes.  This  is  in  agreement  with  our  previous 
conclusion  that  the  particle  will  escape  to  infinity  if  and  only  if  its  total 
energy  is  positive. 

Note  that  r  takes  its  minimum  value  when  9  =  6q.  Thus  9q  specifies 
the  direction  of  the  point  of  closest  approach.  In  the  attractive  case,  the 
constant  I  also  has  a  simple  geometrical  meaning.  It  is  the  radial  distance 
at  right  angles  to  this  direction;  that  is,  r  =  1  when  9  =  6o  ±  ^tt. 

Elliptic  orbits  (E  <  0,  0  <  e  <  1) 

For  most  applications,  it  is  better  to  use  the  orbit  equation  directly  in 
polar  form.  It  is,  however,  straightforward  to  rewrite  it  in  the  possibly 
more  familiar  Cartesian  form  (see  Appendix  B).  If  we  choose  the  axes  so 
that  8q  =  0,  we  obtain  after  some  algebra 


(x  +  aef 

a?       +  62 


where 
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This  is  the  equation  of  an  ellipse  with  centre  at  (— oe,0),  and  semi-axes 
a  and  b.  (See  Fig.  4.6)  It  is  useful  to  note  that  the  semi-major  axis  a  is 


Fig.  4.6 

determined  by  the  value  of  the  energy  E,  while  the  semi-latus  rectum  I  is 
fixed  by  the  angular  momentum  J. 

The  time  taken  by  the  particle  to  traverse  any  part  of  its  orbit  may 
be  found  from  the  relation  (3.26)  between  angular  momentum  and  rate  of 
sweeping  out  area, 

dA_J_ 
~dt  ~  2m' 

All  we  have  to  do  is  to  evaluate  the  area  swept  out  by  the  radius  vector,  and 
multiply  by  2m/ J.  In  particular,  since  the  area  of  the  ellipse  is  A  =  nab, 
the  orbital  period  is  r  =  2mirab/ 'J '.  Thus  by  (4.26)  and  (4.30), 

/  t  \2      rn2a2b2  m2a3l 
\2tt)  J2  m\k\l  ' 

and  thus 


(4.31) 


(It  is  easy  to  verify  directly  from  (4.22)  that  this  gives  the  correct  period 
for  a  circular  orbit  of  radius  a.)  For  a  planet  or  satellite  orbiting  a  central 
body  of  mass  M, 

(5)'-J5f  ^ 

This  yields  Kepler's  third  law  of  planetary  motion:  the  square  of  the  orbital 
period  is  proportional  to  the  cube  of  the  semi-major  axis. 
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Example:  Time  spent  in  the  two  halves  of  Earth's  orbit 

If  the  Earth's  orbit  is  divided  in  two  by  its  minor  axis,  how  much 
longer  does  it  spend  in  one  half  than  the  other? 

The  eccentricity  of  Earth's  orbit  is  e  =  0.0167.  It  is  clear  from 
Fig.  4.6  that  the  areas  swept  out  by  the  position  vector  of  the  Earth  in 
the  two  halves  are  equal  to  half  the  area  of  the  ellipse  ±  the  area  of  a 
triangle  with  base  26  and  height  ae,  namely  ^Tiab±aeb.  Thus  the  times 
taken  are  (|  ±  e/n)  years.  The  difference  between  the  two  is  2e/n  years, 
or  3.88  days. 


Hyperbolic  orbits  (E  >  0,  e  >  1) 

For  both  the  repulsive  and  attractive  cases,  the  Cartesian  equation  of  the 
orbit  is 


(x  -  ae)2  y2 
a2  ¥ 


where 


e2  -  1  2E 


x-^  and 


al 


I2 


2mE 


(4.33) 


This  is  the  equation  of  a  hyperbola  with  centre  at  (+ae,  0)  and  semi-axes  a 
and  b.  (See  Fig.  4.7.)  One  branch  of  the  hyperbola  (on  the  left  in  the  figure) 
corresponds  to  the  orbit  in  the  attractive  case  and  the  other  to  the  orbit 
in  the  repulsive  case.  As  before,  a  is  determined  by  the  energy  E,  and  I  by 


Fig.  4.7 


Central  Conservative  Forces 


89 


the  angular  momentum  J.  Note  that  by  (4.17)  and  (4.18),  the  semi-minor 
axis  b  is  identical  with  the  impact  parameter  introduced  earlier. 

The  directions  in  which  r  becomes  infinite  are,  in  the  repulsive  case, 
9  =  ±  arccos(l/e),  and,  in  the  attractive  case,  6  =  ±[ir  —  arccos(l/e)].  In 
both  cases,  therefore,  the  angle  through  which  the  particle  is  deflected  from 
its  original  direction  of  motion  is 

6  =  vr  -  2  arccos(l/e).  (4.34) 

This  angle  6  is  called  the  scattering  angle.  For  later  use,  it  will  be  helpful 
to  find  the  relation  between  the  impact  parameter  6,  the  scattering  angle  O, 
and  the  limiting  velocity  v  at  infinity.  From  (4.34),  we  have  e  =  sec  ^(ir— 0). 
Substituting  in  (4.33),  this  gives 

b2  =  a2(e2  -  1)  =  a2  [sec2  ±(tt-  9)  -  1]  =  a2  cot2  §0. 

But  also  from  (4.33),  using  E  =  |mv2,  we  have  a  =  \k\/mv2.  Thus  we 
obtain 

6=^Lcoti6.  (4.35) 
mv2  2 


Let  us  complete  this  section  by  considering  a  rather  special  orbit. 

Example:  Orbit  of  a  comet 

The  minimum  distance  of  a  comet  from  the  Sun  is  half  the  radius 
of  the  Earth's  orbit  (assumed  circular),  and  its  velocity  at  that 
point  is  twice  the  orbital  velocity  of  the  Earth.  Find  its  velocity 
when  it  crosses  the  Earth's  orbit,  and  the  angle  at  which  the 
orbits  cross.  Will  the  comet  subsequently  escape  from  the  solar 
system?  What  kind  of  orbit  does  it  follow? 

At  perihelion,  r  =  oe/2  and  v  =  2uCje-  Hence  the  energy  is 

„     „     2  2GMsm 

E  =  2mv2E  —  =  o, 

«E 

using  (4.22).  It  follows  that  the  comet's  velocity  v  when  it  crosses  the 
Earth's  orbit  is  given  by 

,      2     GM^m  „ 
»m»  =  =mvcE, 

fiE 
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whence  v  =  y/2vCtE-  The  angular  momentum  is  J  =  moEiic,E,  whence 
by  (4.26),  I  =  Oe-  Thus  the  angle  a  at  which  the  orbits  cross  is  given  by 

maEWcosa  =  J  =  maEUc,E, 

which  yields  a  =  45°. 

Since  E  =  0,  the  comet  has  just  enough  energy  to  escape  from  the 
solar  system.  Its  orbit  is  a  parabola. 

4.5    Scattering  Cross-sections 

One  of  the  most  important  ways  of  obtaining  information  about  the  struc- 
ture of  small  bodies  (for  example  atomic  nuclei)  is  to  bombard  them  with 
particles  and  measure  the  number  of  particles  scattered  in  various  direc- 
tions. The  angular  distribution  of  scattered  particles  will  depend  on  the 
shape  of  the  target  and  on  the  nature  of  the  forces  between  the  particles 
and  the  target.  To  be  able  to  interpret  the  results  of  such  an  experiment, 
we  must  know  how  to  calculate  the  expected  angular  distribution  when  the 
forces  are  known. 

We  shall  consider  first  a  particularly  simple  case.  We  suppose  that  the 
target  is  a  fixed,  hard  (that  is,  perfectly  elastic)  sphere  of  radius  R,  and 
that  a  uniform,  parallel  beam  of  particles  impinges  on  it.  Let  the  particle 
flux  in  the  beam,  that  is,  the  number  of  particles  crossing  unit  area  normal 
to  the  beam  direction  per  unit  time,  be  /.  Then  the  number  of  particles 
which  strike  the  target  in  unit  time  is 

w  =  fa,  (4.36) 

where  a  is  the  cross-sectional  area  presented  by  the  target,  namely 

ct  =  ttR2.  (4.37) 

Now  let  us  consider  one  of  these  particles.  We  suppose  that  it  impinges 
on  the  target  with  velocity  v  and  impact  parameter  b.  Then,  as  is  clear 
from  Fig.  4.8,  it  will  hit  the  target  at  an  angle  a  to  the  normal  given  by 

b  =  Rs'ma. 

The  force  on  the  particle  is  an  impulsive  central  conservative  force,  corre- 
sponding to  a  potential  V(r)  which  is  zero  for  r  >  R,  and  rises  very  sharply 
in  the  neighbourhood  of  r  =  R.  Thus  the  kinetic  energy  and  angular  mo- 
mentum must  be  the  same  before  and  after  the  collision.  Let  us  take  the 
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Fig.  4.8 

positive  z  direction  (0  =  0)  to  be  the  direction  of  motion  of  the  incoming 
particles.  Then,  by  the  axial  symmetry  of  the  problem,  the  particle  must 
move  in  a  plane  ip  =  constant.  From  energy  conservation,  its  velocity  must 
be  the  same  in  magnitude  before  and  after  the  collision.  Then,  from  an- 
gular momentum  conservation,  it  follows  that  the  particle  will  bounce  off 
the  sphere  at  an  angle  to  the  normal  equal  to  the  incident  angle  a.  Thus 
the  particle  is  deflected  through  an  angle  6  =  n  —  2a,  related  to  the  impact 
parameter  by 

b  =  Rcos\9.  (4.38) 

We  can  now  calculate  the  number  of  particles  scattered  in  a  direction 
specified  by  the  polar  angles  6,  ip,  within  angular  ranges  dO,  dip.  The  parti- 
cles scattered  through  angles  between  9  and  9  +  dO  are  those  that  came  in 
with  impact  parameters  between  b  and  b  +  db,  where 

db  =  -±Rsm±6d6. 

(Note  that  in  our  case  db  is  actually  negative.) 

Consider  now  a  cross-section  of  the  incoming  beam.  The  particles  we 
are  interested  in  are  those  which  cross  a  small  element  of  area 


da  =  b  \db\  dip. 


(4.39) 
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Fig.  4.9 

(See  Fig.  4.9.)  Inserting  the  values  of  b  and  db,  we  find 

do-  =  \R2  sin6  AO  dtp.  (4.40) 

The  rate  at  which  particles  cross  this  area,  and  therefore  the  rate  at  which 
they  emerge  in  the  given  angular  range,  is 

dw  =  /da.  (4.41) 

In  order  to  measure  this  rate,  we  may  place  a  small  detector  at  a  large 
distance  from  the  target  in  the  specified  direction.  We  therefore  want  to 
express  our  result  in  terms  of  the  cross-sectional  area  dA  of  the  detector, 
and  its  distance  L  from  the  target  (which  we  assume  to  be  much  greater 
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than  R).  Now  from  the  discussion  of  §3.5  (see  the  paragraph  preceding 
(3.30)),  we  see  that  the  element  of  area  on  a  sphere  of  radius  L  is 

dA  =  Ld6  x  Lsin9d(p. 

We  define  the  solid  angle  subtended  at  the  origin  by  the  area  d^4  to  be 

dft  =  sin6ld6>d(/>,  (4.42) 

so  that 

dA  =  L2dU.  (4.43) 

The  solid  angle  is  measured  in  steradians  (sr).  It  plays  the  same  role  for  a 
sphere  as  does  the  angle  in  radians  for  a  circle;  Eq.  (4.43)  is  the  analogue 
of  the  equation  ds  =  Ld6  for  a  circle  of  radius  L.  Just  as  the  total  angle 
subtended  by  an  entire  circle  is  2tv,  so  the  total  solid  angle  subtended  by 
an  entire  sphere  is 

IJ  dil=  ^  JJ  dA  =       sin0d0  J    dip  =  4ir. 

The  important  quantity  is  not  the  cross-sectional  area  da  itself,  but  rather 
the  ratio  da/dU,  which  is  called  the  differential  cross-section.  By  (4.41) 
and  (4.43),  the  rate  dw  at  which  particles  enter  the  detector  is 

It  is  useful  to  note  an  alternative  definition  of  the  differential  cross- 
section,  which  is  applicable  even  if  we  cannot  follow  the  trajectory  of  each 
individual  particle,  and  therefore  cannot  say  just  which  of  the  incoming 
particles  are  those  that  emerge  in  a  particular  direction.  We  may  define 
da/dU  to  be  the  ratio  of  scattered  particles  per  unit  solid  angle  to  the 
number  of  incoming  particles  per  unit  area.  Then  the  rate  at  which  particles 
are  detected  is  obtained  by  multiplying  the  differential  cross-section  by  the 
flux  of  incoming  particles,  and  by  the  solid  angle  subtended  at  the  target 
by  the  detector,  as  in  (4.44).  Note  that  da/dil  has  the  dimensions  of  area 
(solid  angle,  like  angle,  is  dimensionless) ;  it  is  measured  in  square  metres 
per  steradian  (m2  sr-1). 

In  the  particular  case  of  scattering  from  a  hard  sphere,  the  differential 
cross-section  is,  by  (4.40)  and  (4.42), 

§5  =  **-  <"5> 
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It  has  the  special  feature  of  being  isotropic,  or  independent  of  the  scattering 
angle.  Thus  the  rate  at  which  particles  enter  the  detector  is,  in  this  case, 
independent  of  the  direction  in  which  it  is  placed.  We  note  that  the  total 
cross-section  (4.37)  is  correctly  given  by  integrating  (4.45)  over  all  solid 
angles;  in  this  case,  we  have  merely  to  multiply  by  the  total  solid  angle,  Aty. 

4.6    Mean  Free  Path 

The  total  cross-section  a  is  useful  in  discussing  the  attenuation  of  a  beam 
of  particles  passing  through  matter.  Let  us  consider  first  a  particle  moving 
through  a  material  containing  n  atoms  per  unit  volume,  and  suppose  that 
the  total  cross-section  for  scattering  by  a  single  atom  is  a.  Consider  a 
cylinder  whose  axis  is  the  line  of  motion  of  the  particle,  and  whose  cross- 
sectional  area  is  a.  Then  the  particle  will  collide  with  any  atoms  whose 
centres  lie  within  the  cylinder.  Now,  the  number  of  such  atoms  in  a  length 
x  of  the  cylinder  is  nax.  This  is  therefore  the  average  number  of  collisions 
made  by  the  particle  when  it  travels  a  distance  x.  Thus  the  mean  distance 
travelled  between  collisions  —  the  mean  free  path  A  —  is  x/nax,  i.e. 


Now  consider  a  beam  of  particles,  with  flux  /,  impinging  normally  on 
a  wall.  How  far  will  they  penetrate?  We  need  to  calculate  the  flux  of 
particles  that  penetrate  to  a  depth  x  without  suffering  a  collision.  Let  this 
flux  be  f(x),  and  consider  a  thin  slice  of  wall  of  thickness  dx  and  area  A  at 
the  depth  x  (see  Fig.  4.10).  The  rate  at  which  unscattered  particles  enter 
the  slice  is  Af(x),  and  the  rate  at  which  they  emerge  on  the  other  side  is 
Af(x  +  dx).  The  difference  between  the  two  must  be  the  rate  at  which 
collisions  occur  within  the  slice.  Now  the  total  number  of  atoms  in  the 
slice  is  nAdx,  and  the  total  cross-sectional  area  presented  by  all  of  them 
is  crnAdx  (assuming  dx  is  small  enough  that  none  of  them  overlap).  Thus 
the  rate  at  which  collisions  occur  in  the  slice  is  f(x)anA  dx.  Equating  the 
two  quantities,  we  have 


(4.46) 


Af(x)-Af(x  +  dx) 


f(x)anAdx  =  f(x)—dx, 


by  (4.46).  Equivalently, 


df(x) 


dx 
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Fig.  4.10 

This  equation  may  be  integrated  to  give 

fix)  =  Ce~x'\ 
where  C  is  an  arbitrary  constant.  But  /(0)  =  /,  so  finally 

f{x)  =  fe~*'\  (4.47) 

Thus  the  number  of  particles  in  the  beam  decreases  by  a  factor  of  1/e  in 
one  mean  free  path. 

Very  often,  when  we  wish  to  study  the  structure  of  a  small  object,  like 
an  atom,  it  would  be  quite  impractical  to  use  a  target  consisting  of  only 
a  single  atom.  Instead,  we  have  to  use  a  target  containing  a  large  number 
N  of  atoms.  If  the  thickness  of  the  target  is  x,  the  rate  at  which  collisions 
occur  within  the  target  will  be 

Af(0)  -  A f{x)  =  Ail  -  e~x'x)f. 

If  x  is  small  compared  to  the  mean  free  path,  then  we  may  retain  only 
the  linear  term  in  the  expansion  of  the  exponential.  In  this  case,  the  rate 
is  approximately  Axf/X  =  Axncrf.  Since  N  =  nAx,  this  shows  that  for 
a  thin  target  the  number  of  particles  scattered  will  be  just  N  times  the 
number  scattered  by  a  single  atom.  For  a  thick  target,  it  would  be  less, 
because  the  atoms  would  effectively  screen  each  other. 

When  the  target  is  thin,  the  probability  of  multiple  collisions,  in  which 
a  particle  strikes  several  target  atoms  in  succession,  will  be  small.  If  we 
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assume  that  it  is  negligible,  then  we  can  conclude  that  the  angular  distribu- 
tion of  scattered  particles  will  also  be  the  same  as  that  from  a  single  target 
atom.  This  will  be  the  case  if  all  the  dimensions  of  the  target  are  small  in 
comparison  with  A.  If  we  use  a  detector  whose  distance  from  the  target,  L, 
is  large  compared  to  the  target  size  (so  that  the  scattering  angle  does  not 
depend  appreciably  on  which  of  the  atoms  in  the  target  is  struck),  then  the 
rate  at  which  particles  enter  the  detector  will  be 

that  is,  just  N  times  the  rate  for  a  single  target  atom. 


4.7    Rutherford  Scattering 

We  discuss  in  this  section  a  problem  which  was  of  crucial  importance  in 
obtaining  an  understanding  of  the  structure  of  the  atom.  In  a  classic  ex- 
periment, performed  in  1911,  Rutherford  bombarded  atoms  with  a-particles 
(helium  nuclei).  Because  these  particles  are  much  heavier  than  electrons, 
they  are  deflected  only  very  slightly  by  the  electrons  in  the  atom  (see  Chap- 
ter 7),  and  can  therefore  be  used  to  study  the  heavy  atomic  nucleus.  From 
observations  of  the  angular  distribution  of  the  scattered  a-particles,  Ruther- 
ford was  able  to  show  that  the  law  of  force  between  a-particle  and  nucleus 
is  the  inverse  square  law  down  to  very  small  distances.  Thus  he  concluded 
that  the  positive  charge  is  concentrated  in  a  very  small  nuclear  volume 
rather  than  being  spread  out  over  the  whole  volume  of  the  atom. 

Let  us  now  calculate  the  differential  cross-section  for  the  scattering  of 
a  particle  of  charge  q  and  mass  m  by  a  fixed  point  charge  q' .  The  impact 
parameter  b  is  related  to  the  scattering  angle  0  by  (4.35),  i.e. 

b  =  acotU,       a=  -   qq    0.  (4.49) 


Thus 

db 


ad9 


2  sin2  ±0' 


so  that,  substituting  in  (4.39),  we  obtain 

da 


a2  cos  |0d0d<p 


2  sin3  ±0 
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Dividing  by  the  solid  angle  (4.42),  we  find  for  the  differential  cross-section 

dcr  a2  , ,  „x 

dn  =  (4-50) 

This  is  the  Rutherford  scattering  cross-section. 

We  note  that,  in  contrast  to  the  differential  cross-section  for  hard-sphere 
scattering,  this  cross-section  is  strongly  dependent  both  on  the  velocity  of 
the  incoming  particle  and  on  the  scattering  angle.  It  also  increases  rapidly 
with  increasing  charge.  For  scattering  of  an  a-particle  on  a  nucleus  of 
atomic  number  Z,  qq'  =  2Ze2  (where  — e  is  the  electronic  charge).  Thus  we 
expect  the  number  of  particles  scattered  to  increase  like  Z2  with  increasing 
atomic  number. 

We  saw  in  §4.3  that  the  minimum  distance  of  approach  is  given  by 
(4.20).  Thus  to  investigate  the  structure  of  the  atom  at  small  distances,  we 
must  use  high-velocity  particles,  for  which  a  is  small,  and  examine  large- 
angle  scattering,  corresponding  to  particles  with  small  impact  parameter 
b.  The  cross-section  (4.50)  is  large  for  small  values  of  the  scattering  angle, 
but  physically  it  is  the  large-angle  scattering  which  is  of  most  interest. 
For,  the  fact  that  particles  can  be  scattered  through  large  angles  is  an 
indication  that  there  are  very  strong  forces  acting  at  short  distances.  If  the 
positive  nuclear  charge  were  spread  out  over  a  large  volume,  the  force  would 
be  inverse-square-law  only  down  to  the  radius  of  the  charge  distribution. 
Beyond  that  point,  it  would  decrease  as  we  go  to  even  smaller  distances. 
Consequently,  the  particles  that  penetrate  to  within  this  distance  would 
experience  a  weaker  force  than  the  inverse  square  law  predicts,  and  would 
be  scattered  through  smaller  angles. 

A  peculiar  feature  of  the  differential  cross-section  (4.50)  is  that  the 
corresponding  total  cross-section  is  infinite.  This  is  a  consequence  of  the 
infinite  range  of  the  Coulomb  force.  However  far  away  from  the  nucleus  a 
particle  may  be,  it  still  experiences  some  force,  and  is  scattered  through  a 
non-zero  (though  small)  angle.  Thus  the  total  number  of  particles  scattered 
through  any  angle,  however  small,  is  indeed  infinite.  We  can  easily  calculate 
the  number  of  particles  scattered  through  any  angle  greater  than  some  small 
lower  limit  9q.  These  are  the  particles  which  had  impact  parameters  b  less 
than  bo  =  a  cot  ^9o-  The  corresponding  cross-section  is,  therefore, 


(6>  >  0O)  =  tt&o  =  ™  cot  h°o-  (4-51) 


(This  may  also  be  verified  by  direct  integration  of  (4.50).) 
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4.8  Summary 

For  a  particle  moving  under  any  central,  conservative  force,  information 
about  the  radial  motion  may  be  obtained  from  the  radial  energy  equation, 
which  results  from  eliminating  9  between  the  conservation  equations  for 
energy  and  angular  momentum.  The  values  of  E  and  J  can  be  determined 
from  the  initial  conditions,  and  this  equation  then  tells  us  the  radial  velocity 
at  any  value  of  r. 

When  information  about  the  angle  9  is  needed,  we  must  find  the  equa- 
tion of  the  orbit.  For  the  inverse  square  law,  the  orbit  is  an  ellipse  or  a 
hyperbola,  according  as  E  <  0  or  E  >  0.  The  semi-major  axis  is  fixed  by 
E,  and  the  semi- latus  rectum  by  J. 

If  we  are  concerned  with  finding  the  time  taken  to  traverse  part  of  the 
orbit,  we  can  use  the  relation  between  the  angular  momentum  and  the  rate 
of  sweeping  out  area. 

When  a  beam  of  particles  strikes  a  target,  the  angular  distribution  of 
scattered  particles  may  be  found  from  the  differential  cross-section  da/dfl. 
This  may  be  calculated  from  a  knowledge  of  the  relation  between  the  scat- 
tering angle  and  the  impact  parameter.  The  attenuation  of  the  beam  is 
related  to  the  total  cross-section  a,  obtained  by  integrating  d<r/dfi  over  all 
solid  angles. 


Problems 

1.  The  orbits  of  synchronous  communications  satellites  have  been  chosen 
so  that  viewed  from  the  Earth  they  appear  to  be  stationary.  Find  the 
radius  of  the  orbits. 

2.  Find  the  radii  of  synchronous  orbits  about  Jupiter  and  about  the  Sun. 
[Their  mean  rotation  periods  are  10  hours  and  27  days,  respectively. 
The  mass  of  Jupiter  is  318  times  that  of  the  Earth.  The  semi-major 
axis  of  the  Earth's  orbit,  or  astronomical  unit  (au)  is  1.50  x  108km.] 

3.  The  semi-major  axis  of  Jupiter's  orbit  is  5.20  AU.  Find  its  orbital  period 
in  years,  and  its  mean  (time-averaged)  orbital  speed.  (Mean  orbital 
speed  of  Earth  =  29.8  kms-1.) 

4.  The  orbit  of  an  asteroid  extends  from  the  Earth's  to  Jupiter's,  just 
touching  both.  Find  its  orbital  period.  (Treat  the  planetary  orbits  as 
circular  and  coplanar.) 
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5.  Find  the  maximum  and  minimum  orbital  speeds  of  the  asteroid  in  Prob- 
lem 4. 

6.  The  Moon's  mass  and  radius  are  0.0123  ME  and  0.273  RE  (E  =  Earth). 
For  Jupiter  the  corresponding  figures  are  318  Me  and  11.0  Re-  Find  in 
each  case  the  gravitational  acceleration  at  the  surface,  and  the  escape 
velocity. 

7.  Calculate  the  period  of  a  satellite  in  an  orbit  just  above  the  Earth's 
atmosphere  (whose  thickness  may  be  neglected).  Find  also  the  periods 
for  close  orbits  around  the  Moon  and  Jupiter. 

8.  The  Sun  has  an  orbital  speed  of  about  220kms_1  around  the  centre 
of  the  Galaxy,  whose  distance  is  28  000  light  years.  Estimate  the  total 
mass  of  the  Galaxy  in  solar  masses. 

9.  A  particle  of  mass  m  moves  under  the  action  of  a  harmonic  oscillator 
force  with  potential  energy  |fcr2.  Initially,  it  is  moving  in  a  circle  of 
radius  a.  Find  the  orbital  speed  v.  It  is  then  given  a  blow  of  impulse 
mv  in  a  direction  making  an  angle  a  with  its  original  velocity.  Use  the 
conservation  laws  to  determine  the  minimum  and  maximum  distances 
from  the  origin  during  the  subsequent  motion.  Explain  your  results 
physically  for  the  two  limiting  cases  a  =  0  and  a  =  n. 

10.  Write  down  the  effective  potential  energy  function  U(r)  for  the  system 
described  in  Chapter  3,  Problem  11.  Initially,  the  particle  is  moving 
in  a  circular  orbit  of  radius  2a.  Find  the  orbital  angular  velocity  u  in 
terms  of  the  natural  angular  frequency  loo  of  the  oscillator  when  not 
rotating.  If  the  motion  is  lightly  disturbed,  the  particle  will  execute 
small  oscillations  about  the  circular  orbit.  By  considering  the  effec- 
tive potential  energy  function  U(r)  near  its  minimum,  find  the  angular 
frequency  u)'  of  small  oscillations.  Hence  describe  the  disturbed  orbit 
qualitatively. 

11.  Show  that  the  comet  discussed  at  the  end  of  §4.4  crosses  the  Earth's 
orbit  at  opposite  ends  of  a  diameter.  Find  the  time  it  spends  inside  the 
Earth's  orbit.  (To  evaluate  the  area  required,  write  the  equation  of  the 
orbit  in  Cartesian  co-ordinates.  See  Appendix  B.) 

12.  A  star  of  mass  M  and  radius  R  is  moving  with  velocity  v  through  a 
cloud  of  particles  of  density  p.  If  all  the  particles  that  collide  with  the 
star  are  trapped  by  it,  show  that  the  mass  of  the  star  will  increase  at 
a  rate 
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Given  that  M  =  1031  kg  and  R  =  108  km,  find  how  the  effective  cross- 
sectional  area  compares  with  the  geometric  cross-section  tvR2  for  ve- 
locities of  1000  kms-1,  100  kms-1  and  lOkrns-1. 

13.  *Find  the  polar  equation  of  the  orbit  of  an  isotropic  harmonic  oscillator 
by  solving  the  differential  equation  (4.25),  and  verify  that  it  is  an  ellipse 
with  centre  at  the  origin.  (Hint:  Change  to  the  variable  v  =  u2 .)  Check 
also  that  the  period  is  given  correctly  by  r  =  2mA/ J. 

14.  Discuss  qualitatively  the  orbits  of  a  particle  under  a  repulsive  force 
with  potential  energy  function  V  =  ^kr2  where  k  is  negative,  using  the 
effective  potential  energy  function  U.  How  would  the  orbit  equation, 
as  found  in  Problem  13,  differ  in  this  case?  What  shape  is  the  orbit? 

15.  *If  the  Earth's  orbit  is  divided  in  two  by  the  latus  rectum,  show  that 
the  difference  in  time  spent  in  the  two  halves,  in  years,  is 

—  \  e\J  1  —  e2  +  arcsinej  , 

and  hence  for  small  e  about  twice  as  large  as  the  difference  computed 
in  the  example  in  §4.4.  (Hint:  Use  Cartesian  co-ordinates  to  evaluate 
the  required  area.  The  identity  ty/2  —  arcsin  \f\  —  e2  =  arcsine  may  be 
useful.) 

16.  A  spacecraft  is  to  travel  from  Earth  to  Jupiter  along  an  elliptical  orbit 
that  just  touches  each  of  the  planetary  orbits  (i.e.,  the  orbit  of  the 
asteroid  in  Problem  4).  Use  the  results  of  Problems  3,  4  and  5  to 
find  the  relative  velocity  of  the  spacecraft  with  respect  to  the  Earth 
just  after  launching  and  that  with  respect  to  Jupiter  when  it  nears 
that  planet,  neglecting  in  each  case  the  gravitational  attraction  of  the 
planet.  Where  in  its  orbit  must  Jupiter  be  at  the  time  of  launch,  relative 
to  the  Earth?  Where  will  the  Earth  be  when  it  arrives? 

[This  semi-elliptical  trajectory  is  known  as  a  Hohmann  transfer  and  it 
is  energy-efficient  for  interplanetary  travel  using  high-thrust  rockets  in 
that  a  discrete  boost  is  required  at  the  beginning  and  at  the  end  of  the 
journey  in  order  respectively  to  leave  Earth  and  to  arrive  at  the  target 
planet.  A  similarly  careful  choice  of  timing  for  initiating  a  return  to 
Earth  is  necessary,  so  that  there  is  an  inevitable  minimum  time  that 
must  be  spent  in  the  region  of  (here)  Jupiter  before  this  can  be  done.  It 
is  an  interesting  and  important  exercise  to  compare  timings  for  round 
trips  of  planetary  exploration  using  this  form  of  transfer.  For  Mars,  an 
obvious  target  in  the  short  term,  the  time  required  is  about  32  Earth 
months  with  about  15  months  spent  at  Mars.  There  are  of  course  other 
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ways  of  effecting  transfer,  but  at  rather  greater  cost,  either  in  fuel  or 
in  time  spent  at  the  destination!] 

17.  *Suppose  that  the  asteroid  of  Problems  4  and  5  approaches  the  Earth 
with  an  impact  parameter  of  5Re,  where  i?E  =  Earth's  radius,  moving 
in  the  same  plane  and  overtaking  it.  (This  is  an  improbably  close 
encounter  for  a  large  asteroid;  however  the  spectacular  impact  of  comet 
Shoemaker-Levy  9  with  Jupiter  in  July  1994  should  prevent  us  from 
being  too  complacent  about  this  threat!)  Find  the  distance  of  closest 
approach  and  the  angle  through  which  the  asteroid  is  scattered,  in  the 
frame  of  reference  in  which  the  Earth  is  at  rest.  (Assume  that  the 
asteroid  is  small  enough  to  have  negligible  effect  on  the  Earth's  orbit.) 
What  is  its  new  velocity  v  relative  to  the  Sun?  Show  that  the  semi- 
major  axis  of  its  new  orbit  is  ciEVE/(2t;E  —  v2),  where  <ze  and  i>e  are 
the  Earth's  orbital  radius  and  orbital  velocity.  Find  the  asteroid's  new 
orbital  period. 

18.  *Show  that  the  position  of  a  planet  in  its  elliptical  orbit  can  be  ex- 
pressed, using  a  frame  with  a;- axis  in  the  direction  of  perihelion  (point 
of  closest  approach  to  the  Sun) ,  in  terms  of  an  angular  parameter  ip  by 
x  =  a(costp  —  e),  y  =  bsinip.  (See  Problem  B.l.  In  the  literature,  ip  is 
sometimes  called  the  eccentric  anomaly,  while  the  polar  angle  9  is  the 
true  anomaly.)  Show  that  r  =  a(l  —  ecos?/'),  and  that  the  time  from 
perihelion  is  given  by  t  =  (t/2it)(iP  —  esini/>)  (Kepler's  equation). 

19.  Use  the  parametrization  of  Problem  18  to  calculate  the  time-averaged 
values  of  the  kinetic  and  potential  energies  T  and  V  over  a  complete 
period.  Hence  verify  the  virial  theorem,  (U)av  =  —  2(T)av. 

20.  *Find  a  parametrization  similar  to  that  of  Problem  18  for  a  hyperbolic 
orbit,  using  hyperbolic  functions. 

21.  On  reaching  the  vicinity  of  Jupiter,  the  spacecraft  in  Problem  16  is 
swung  around  the  planet  by  its  gravitational  attraction  —  a  'sling- 
shot' manoeuvre.  Consider  this  encounter  in  the  frame  of  reference  in 
which  Jupiter  is  at  rest.  What  is  the  magnitude  and  direction  of  the 
spacecraft's  velocity  before  scattering?  What  is  its  magnitude  after 
scattering?  If  the  scattering  angle  in  this  frame  is  90°,  what  must  be 
the  impact  parameter?  What  is  the  distance  of  closest  approach  to  the 
planet,  in  terms  of  Jupiter  radii?  (Mj  =  318ME,  Rj  =  11.0i?E-) 

22.  *If  the  manoeuvre  in  Problem  21  is  in  the  orbital  plane,  so  that  the 
final  velocity  of  the  spacecraft  relative  to  Jupiter  is  radially  away  from 
the  Sun,  what  is  its  velocity  in  magnitude  and  direction  relative  to 
the  Sun?  Use  the  radial  energy  equation  to  determine  the  spacecraft's 
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farthest  distance  from  the  Sun  (its  aphelion  distance)  in  astronomical 
units.  Find  also  its  new  orbital  period.  When  it  returns,  what  will  be 
its  perihelion  distance? 

23.  An  alternative  to  the  manoeuvre  described  in  Problem  22  is  for  the 
spacecraft  to  be  scattered  out  of  the  orbital  plane.  Assume  that  rel- 
ative to  Jupiter  its  velocity  after  scattering  is  directed  normal  to  the 
orbital  plane.  What  is  its  velocity  relative  to  the  Sun?  What  will  be  its 
aphelion  distance  and  orbital  period?  How  far  from  the  orbital  plane 
will  it  reach?  (Hint:  Immediately  after  scattering,  the  radial  compo- 
nent of  its  velocity  is  zero.  This  is  therefore  the  perihelion  point  of  the 
new  orbit.  The  farthest  point  from  the  original  orbital  plane  will  occur 
when  it  is  at  one  end  of  the  semi-minor  axis  of  the  orbit.) 

24.  *A  ballistic  rocket  (one  that  moves  freely  under  gravity  after  its  initial 
launch)  is  fired  from  the  surface  of  the  Earth  with  velocity  v  <  \[Rg 
at  an  angle  a  to  the  vertical.  (Ignore  the  Earth's  rotation.)  Find 
the  equation  of  its  orbit.  Express  the  range  2R9  (measured  along  the 
Earth's  surface)  in  terms  of  the  parameters  I  and  a,  and  hence  show 
that  to  maximize  the  range,  we  should  choose  a  so  that  I  =  2a  —  R. 
(Hint:  A  sketch  may  help.)  Deduce  that  the  maximum  range  is  2R6 
where  sin#  =  v2/(2Rg  —  v2).  Given  that  the  maximum  range  is  3600 
nautical  miles,  find  the  launch  velocity  and  the  angle  at  which  the 
rocket  should  be  launched.  (Note:  1  nautical  mile  =  1  minute  of  arc 
over  the  Earth's  surface.) 

25.  Discuss  the  possible  types  of  orbit  for  a  particle  moving  under  a  central 
inverse- cube- law  force,  described  by  the  potential  energy  function  V  = 
k/2r2.  For  the  repulsive  case  (k  >  0),  show  that  the  orbit  equation  is 
r  cos  n(9  —  9q)  =  b,  where  n,  b  and  #0  are  constants.  Show  that  for  the 
attractive  case  (k  <  0),  the  nature  of  the  orbit  depends  on  the  signs 
of  J2  +  mk  and  of  E.  Find  the  equation  of  the  orbit  for  each  possible 
type.  (Include  the  cases  where  one  of  these  parameters  vanishes.) 

26.  Show  that  the  scattering  angle  for  particles  of  mass  m  and  initial  ve- 
locity v  scattered  by  a  repulsive  inverse-cube- law  force  is  ir  —  ir/n  (see 
Problem  25).  Hence  find  the  differential  cross-section. 

27.  *The  potential  energy  of  a  particle  of  mass  m  is  V(r)  =  k/r  +  c/3r3, 
where  k  <  0  and  c  is  a  small  constant.  (The  gravitational  potential 
energy  in  the  equatorial  plane  of  the  Earth  has  approximately  this 
form,  because  of  its  flattened  shape  —  see  Chapter  6.)  Find  the  angular- 
velocity  w  in  a  circular  orbit  of  radius  a,  and  the  angular  frequency  u' 
of  small  radial  oscillations  about  this  circular  orbit.  Hence  show  that  a 
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nearly  circular  orbit  is  approximately  an  ellipse  whose  axes  precess  at 
an  angular  velocity  il  «  (c/\k\a2)u>. 

28.  A  beam  of  particles  strikes  a  wall  containing  2  x  1029  atoms  per  m3. 
Each  atom  behaves  like  a  sphere  of  radius  3  x  10-15  m.  Find  the  thick- 
ness of  wall  that  exactly  half  the  particles  will  penetrate  without  scat- 
tering. What  thickness  would  be  needed  to  stop  all  but  one  particle  in 
106? 

29.  An  a-particle  of  energy  4keV  (leV  =  1.6  x  10_19J)  is  scattered  by 
an  aluminium  atom  through  an  angle  of  90°.  Calculate  the  distance 
of  closest  approach  to  the  nucleus.  (Atomic  number  of  a-particle  =  2, 
atomic  number  of  Al  =  13,  e  =  1.6  x  10-19  C.)  A  beam  of  such  parti- 
cles with  a  flux  of  3  x  10s  m-2  s_1  strikes  a  target  containing  50  mg  of 
aluminium.  A  detector  of  cross-sectional  area  400  mm2  is  placed  0.6  m 
from  the  target  in  a  direction  at  right  angles  to  the  beam  direction. 
Find  the  rate  of  detection  of  a-particles.  (Atomic  mass  of  Al  =  27  u; 
lu  =  1.66  x  10-27kg.) 

30.  *It  was  shown  in  §3.4  that  Kepler's  second  law  of  planetary  motion 
implies  that  the  force  is  central.  Show  that  his  first  law  —  that  the 
orbit  is  an  ellipse  with  the  Sun  at  a  focus  —  implies  the  inverse  square 
law.  (Hint:  By  differentiating  the  orbit  equation  l/r  =  1  +  ecos#,  and 
using  (3.26),  find  r  and  f  in  terms  of  r  and  9.  Hence  calculate  the 
radial  acceleration.) 

31.  Show  that  Kepler's  third  law,  r  oc  a3/2,  implies  that  the  force  on  a 
planet  is  proportional  to  its  mass. 

[This  law  was  originally  expressed  by  Kepler  as  r  oc  f3/2,  where  f  is  a 
'mean  value'  of  r.  For  an  ellipse,  the  mean  over  angle  9  is  in  fact  b;  the 
mean  over  time  is  actually  a(l  +  |e2);  it  is  the  mean  over  arc  length 
—  or  the  median  —  which  is  given  by  al  Of  course,  for  most  planets 
in  our  Solar  System  these  values  are  almost  equal.] 
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Chapter  5 

Rotating  Frames 


Hitherto,  we  have  always  used  inertial  frames,  in  which  the  laws  of  motion 
take  on  the  simple  form  expressed  in  Newton's  laws.  There  are,  however,  a 
number  of  problems  that  can  most  easily  be  solved  by  using  a  non-inertial 
frame.  For  example,  when  discussing  the  motion  of  a  particle  near  the 
Earth's  surface,  it  is  often  convenient  to  use  a  frame  which  is  rigidly  fixed 
to  the  Earth,  and  rotates  with  it.  In  this  chapter,  we  shall  find  the  equations 
of  motion  with  respect  to  such  a  frame,  and  discuss  some  applications  of 
them. 


5.1    Angular  Velocity;  Rate  of  Change  of  a  Vector 

Let  us  consider  a  solid  body  which  is  rotating  with  constant  angular  velocity 
co  about  a  fixed  axis.  Let  n  be  a  unit  vector  along  the  axis,  whose  direction 
is  defined  by  the  right-hand  rule:  it  is  the  direction  in  which  a  right-hand- 
thread  screw  would  move  when  turned  in  the  direction  of  the  rotation. 
Then  we  define  the  vector  angular  velocity  w  to  be  a  vector  of  magnitude 
uj  in  the  direction  of  n:  uj  =  urn.  Clearly,  angular  velocity,  like  angular 
momentum,  is  an  axial  vector  (see  §3.3). 

For  example,  for  the  Earth,  uj  is  a  vector  pointing  along  the  polar  axis, 
towards  the  north  pole.  Its  magnitude  is  equal  to  2n  divided  by  the  length 
of  the  sidereal  day  (the  rotation  period  with  respect  to  the  fixed  stars, 
which  is  less  than  that  with  respect  to  the  Sun  by  one  part  in  365),  that  is 

uj  =  s"1  =  7.292  x  10-5s_1.  (5.1) 

86164  v  ' 

If  we  take  the  origin  to  lie  on  the  axis  of  rotation,  then  the  velocity  of 
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a  point  of  the  body  at  position  r  is  given  by  the  simple  formula 

»  =  uAr.  (5.2) 

To  prove  this,  we  note  that  the  point  moves  with  angular  velocity  w  around 
a  circle  of  radius  p  =  rs'mO  (see  Fig.  5.1).  Thus  its  speed  is 


CO 

e A 

Fig.  5.1 


v  =  uip  =  uir  smd  =  \u>  A  r\. 

Moreover,  the  direction  of  v  is  that  of  o>  A  r;  for  clearly,  v  is  perpendicular 
to  the  plane  containing  u>  and  r,  and  it  is  easy  to  see  from  the  figure  that 
its  sense  is  given  correctly  by  the  right-hand  rule.  Thus  (5.2)  is  correct 
both  as  regards  magnitude  and  direction. 

It  is  not  necessary  that  r  should  be  the  position  vector  of  a  point  of  the 
rotating  body.  If  a  is  any  vector  fixed  in  the  rotating  body,  then  by  the 
same  argument, 

^=u>Aa.  (5.3) 

In  particular,  if  i,  j,  k  are  orthogonal  unit  vectors  fixed  in  the  body,  then 

di  dj  dk  ,  . 

-  =  u,A*,        ^="Aj,        ^=«Afc.  (5.4) 

For  example,  if  w  is  in  the  k  direction,  then 

di        .         dj  .  dk 

(Compare  Fig.  5.2.) 
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Fig.  5.2 

Now  consider  a  vector  a  specified  with  respect  to  the  rotating  axes  i,  j ,  k 
by  the  components  ax,ay,az,  so  that 


azk. 


(Our  discussion  applies  quite  generally,  but  it  may  be  helpful  to  think  of 
a  as  the  position  vector  of  a  particle  with  respect  to  the  Earth.)  We  must 
distinguish  two  kinds  of  'rates  of  change',  and  it  will  be  convenient  to 
suspend  for  the  moment  the  convention  whereby  da/dt  and  a  mean  the 
same  thing.  We  shall  denote  by  da/dt  the  rate  of  change  of  a  as  measured 
by  an  inertial  observer  at  rest  relative  to  the  origin,  and  by  a  the  rate  of 
change  as  measured  by  an  observer  rotating  with  the  solid  body.  We  wish 
to  find  the  relation  between  these  two  rates  of  change. 

Now,  although  our  two  observers  differ  about  the  rate  of  change  of  a 
vector,  they  will  always  agree  about  the  rate  of  change  of  any  scalar  quan- 
tity, and  in  particular  about  the  rates  of  change  of  the  three  components 
ax,  ay,  az.  (Note  that  for  the  inertial  observer,  these  are  not  the  components 
of  a  with  respect  to  his  own  set  of  axes.  An  observer  outside  the  Earth  will 
always  agree  with  one  on  the  Earth  about  the  latitude  and  longitude  of  a 
small  body  moving  on  the  surface,  though  these  are  not  the  co-ordinates  he 
would  naturally  use  himself  to  describe  its  position.)  Hence  we  can  write 


dax 
dt 


<jgy 
dt 


daz 
dt 


a*. 


(5.5) 


According  to  the  observer  on  the  rotating  body,  the  rate  of  change  of  a 
is  fully  described  by  the  rates  of  change  of  its  three  components,  so  that 


d  =  axi  +  ciyj  +  dzk. 


(5.6) 
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However,  to  the  inertial  observer,  the  axes  i,j,k  are  themselves  changing 
with  time  according  to  (5.4),  so  that 

da      /  dax  .     dav  .     daz ,  \      (    di         dj         dk  \ 

*  =  ( + ^J  +  -drk) +  {a*dt + a«i + a^) 

=  (axi  +  ayj  +  azk)  +  uA  (axi  +  ayj  +  azk), 
by  (5.5)  and  (5.4).  Thus,  finally,  we  obtain 

d°      .  ,„  „N 

—  =  a  +  u>  A  a.  (5.7) 

Applied  to  the  position  vector  r,  this  result  is  almost  obvious,  for  it  states 
that  the  velocity  with  respect  to  the  inertial  observer  is  the  sum  of  the 
velocity  r  with  respect  to  the  rotating  frame  and  the  velocity  u>  A  r  of  a 
particle  at  r  rotating  with  the  body.  We  have  given  a  detailed  proof  because 
of  the  central  importance  of  this  result,  and  because  its  application  to  other 
vectors  is  much  less  obvious. 

We  shall  frequently  encounter  in  our  later  work  equations  similar  to 
(5.3):  da/dt  =  u>  A  a.  It  is  important  to  realize  that  one  can  reverse 
the  argument  that  led  up  to  it.  If  a  satisfies  this  equation,  then  it  must 
be  a  vector  of  constant  length  rotating  with  angular  velocity  ui  about  the 
direction  of  oj.  For,  if  we  introduce  a  frame  of  reference  rotating  with 
angular  velocity  w,  then  according  to  (5.7),  a  =  0,  so  that  a  is  fixed  in  the 
rotating  frame. 


5.2    Particle  in  a  Uniform  Magnetic  Field 

A  particle  of  charge  q  moving  with  velocity  v  in  a  magnetic  field  B(r) 
experiences  a  force  proportional  to  its  velocity,  and  perpendicular  to  it, 

F  =  qv  A  B.  (5.8) 

The  equation  of  motion  is  then 

dv 

m—  =  qv  A  B.  (5.9) 

Now  let  us  suppose  that  the  magnetic  field  is  uniform  (independent  of 
position)  and  constant  in  time.  Then  (5.9)  has  precisely  the  form  of  (5.3), 


—  =  U)  A  v,        where       ui  =  B. 

dt  m 


(5.10) 
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Hence,  by  the  argument  of  the  preceding  section,  the  velocity  vector  v 
rotates  about  the  direction  of  B  with  a  constant  angular  velocity.  It  follows 
that  the  path  of  the  particle  is  a  helix  whose  axis  points  along  the  field  lines 
(see  Fig.  5.3).  If  we  take  the  direction  of  B  to  be  the  z-direction,  then  v  has 


Fig.  5.3 


a  constant  z-component,  and  a  component  in  the  :ry-plane  which  rotates 
around  a  circle  with  angular  velocity  u>. 

In  particular,  if  v  is  initially  parallel  to  B,  the  particle  will  continue 
to  move  with  uniform  velocity  along  a  field  line.  The  other  limiting  case, 
in  which  v  is  perpendicular  to  B,  is  more  interesting.  In  that  case,  the 
particle  describes  a  circle  of  radius 

v  mv 

r=-  =  —.  5.11) 

u  qB 

This  effect  is  used  in  accelerating  particles  in  the  cyclotron.  In  order 
to  prevent  the  particles  from  escaping,  a  strong  magnetic  field  is  employed 
to  constrain  them  to  move  in  circles.  To  accelerate  them,  an  alternating 
voltage  is  applied  between  two  D-shaped  pole  pieces  enclosing  two  halves  of 
the  circles  (see  Fig.  5.4).  The  angular  frequency  of  this  alternating  voltage 
is  chosen  to  coincide  with  the  cyclotron  frequency 

ujc  =  — .  (5.12) 

m 

Tims,  by  the  time  the  particles  have  made  a  half-revolution,  the  direction 
of  the  electric  field  has  reversed,  and  they  experience  an  accelerating  field 
each  time  they  cross  the  gap.  Each  semicircle  is  therefore  slightly  larger 
than  the  preceding  one,  and  the  particles  spiral  outwards  with  increasing 
energy.  Finally,  one  obtains  a  beam  of  high-velocity  particles  at  the  outer 
edge  of  the  cyclotron. 

This  method  can  be  used  only  so  long  as  the  speed  remains  small  com- 
pared to  the  speed  of  light;  for,  according  to  relativity  theory,  the  angular- 
frequency  ujc  is  not  precisely  constant  when  the  speed  is  large,  but  decreases 
with  increasing  speed.  Quite  similar  methods  can  be  used,  however,  even 
at  the  highest  energies. 
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Fig.  5.4 

If  there  are  both  electric  and  magnetic  fields  present,  a  charged  particle 
experiences  the  Lorentz  force,  F  =  q(E  +  v  A  B).  An  interesting  case  is 
that  of  crossed  fields. 

Example:  Crossed  electric  and  magnetic  fields 

A  particle  of  mass  m  and  charge  q  moves  in  uniform  perpendic- 
ular electric  and  magnetic  fields,  E  =  Ej  and  B  =  Bk.  Show 
that  if  it  is  initially  moving  in  the  xy-plane,  it  will  continue  to  do 
so,  and  that  it  has  a  circular  motion  with  a  superimposed  uni- 
form drift  velocity  in  a  direction  perpendicular  to  both  E  and 
B.  Find  the  solution  for  the  case  when  the  particle  starts  from 
rest  at  the  origin. 

The  equation  of  motion  mr  =  qE+qrAB  is,  in  terms  of  components, 

mx  =  qBy,        my  =  qE  —  qBx,        mz  =  0. 

Clearly,  if  z  =  z  =  0  initially,  this  will  be  true  always.  We  can  eliminate 
the  term  in  E  by  going  over  to  a  frame  moving  with  relative  velocity 
(E/B)i  (see  Chapter  1,  Problem  5),  in  which  the  co-ordinates  are  given 
by 


x'  =  x-(E/B)t,      y'  =  y,      z'  =  z. 
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Then  x!  =  x  —  E  /  B,y'  =  y,z'  =  z.  Thus  the  equations  of  motion  become 

mi'  =  qBy' ,       my'  =  —qBx',       mz'  =  0. 

So  in  the  moving  frame,  the  particle  moves  in  a  circle,  as  in  the  discussion 
above.  In  the  original  frame,  this  means  circular  motion  with  a  uniform 
velocity  superimposed.  The  initial  conditions  yield  x'  =  —E/B,  so  the 
radius  of  the  circular  motion  is  E/Bujc.  If  the  particle  starts  from  the 
origin,  the  complete  solution  is 

E  (      smu)rt\  E 

y=— — (l-coswct),        z  =  0. 


B  V         ojc    J  Blo. 


5.3    Acceleration;  Apparent  Gravity 

We  can  use  the  formula  (5.7)  twice  to  obtain  the  relation  between  the 
absolute  acceleration  d2r/dt2  of  a  particle  and  its  acceleration  r  relative  to 
a  rotating  frame. 

The  velocity  of  the  particle  relative  to  an  inertial  frame  is 

v  =  —  =  r  +  u)  Ar.  (5.13) 

Applying  the  same  formula  to  the  rate  of  change  of  v,  we  have 

d2^      dv      .  . 
_  =  _  =  v  +  wAv.  (5.14) 

But,  from  (5.13),  assuming  that  u>  is  constant,  we  find 

v  =  r  +  u>  A  r, 

and 

u)A))=uAf  IwA(uAr). 
Hence,  substituting  in  (5.14),  we  obtain 
d2r 

— -  =  r  +  2w  A  r  +  u>  A  (w  A  r).  (5.15) 
At1 

The  second  term  on  the  right  is  called  the  Coriolis  acceleration,  and  the 
third  term  is  the  centripetal  acceleration.  The  latter  is  directed  inwards 
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towards  the  axis  of  rotation,  and  perpendicular  to  it,  as  may  be  seen  by 
writing  it  in  the  form  (see  (A. 16)) 

u>  A  (w  A  r)  =  (u>  ■  r)ui  —  oj2r. 

The  most  important  application  of  (5.15)  is  to  a  particle  moving  near 
the  surface  of  the  Earth.  For  a  particle  moving  under  gravity,  and  under 
an  additional,  mechanical  force  F,  the  equation  of  motion  is 

d2r 

m-^j  =mg  +  F, 

where  g  is  a  vector  of  magnitude  g  pointing  downward  (towards  the  centre 
of  the  Earth).  Using  (5.15),  and  moving  all  the  terms  except  the  relative 
acceleration  to  the  right  side  of  the  equation,  we  can  write  it  as 

mr  =  mg  +  F  —  2mw  A  r  —  mu  A  (u>  A  r).  (5.16) 

The  last  two  terms  are  apparent  (or  fictitious)  forces,  which  arise  because 
of  the  non-inertial  nature  of  the  frame  of  reference.  We  shall  postpone 
discussion  of  the  effects  of  the  third  term  —  the  Coriolis  force  —  to  the  next 
section,  and  consider  here  the  last  term,  which  is  the  familiar  centrifugal 
force. 

This  force  is  a  slowly  varying  function  of  position,  proportional,  like 
gravity,  to  the  mass  of  the  particle.  When  we  make  a  laboratory  measure- 
ment of  the  acceleration  due  to  gravity,  what  we  actually  measure  is  not  g 
but 

g*  =g-wA(wAr).  (5.17) 

In  particular,  a  plumb  line  does  not  point  directly  towards  the  centre  of 
the  Earth,  but  is  swung  outwards  through  a  small  angle  by  the  centrifugal 
force.  (See  Fig.  5.5,  in  which  this  effect  is  greatly  exaggerated.)  Let  us 
consider  a  point  in  colatitude  (tt/2  —  latitude)  9.  Then 

|uA(wAr)|  =  u)\oj  A  r\  =  u)2rsm9. 

Thus  the  horizontal  and  vertical  components  of  the  effective  gravitational 
acceleration  g*  (using  'vertical'  to  mean  towards  the  Earth's  centre)  are 

g^  =  uj2r  sin  9  cos  9,       g*  =  g  —  ui2r  sin2  9. 

(There  are  further  corrections,  to  be  discussed  later,  due  to  the  non- 
spherical  shape  of  the  Earth.) 
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Fig.  5.5 

The  magnitude  of  the  centrifugal  force  may  be  found  by  inserting  the 
value  (5.1)  for  ui,  and  for  r  the  mean  radius  of  the  Earth,  6371  km.  We  find 

cA-  =  34mms-2.  (5.18) 

Since  ui2r  -C  9,  the  angle  a  between  the  apparent  and  true  verticals  is 
approximately 

9h  uj2'r 
a  «  —  «   sin  9  cos  8. 

9*  9 

The  maximum  value  occurs  at  6  =  45°,  and  is  about  0°6'. 

At  the  pole,  there  is  no  centrifugal  force,  and  g*  =  g.  On  the  equator, 
g*  =  g  —  uj2r.  Thus  we  might  expect  the  measured  value  of  the  acceleration 
due  to  gravity  to  be  larger  at  the  pole  by  34mms-2.  The  actual  measured 
difference  is  somewhat  larger  than  this, 

A9*  =  9po\e  ~  Sequator  =  52mmS-2.  (5.19) 

This  discrepancy  arises  from  the  fact  that  the  Earth  is  not  a  perfect  sphere, 
but  more  nearly  spheroidal  in  shape,  flattened  at  the  poles.  Thus  the 
gravitational  acceleration  g,  even  excluding  the  centrifugal  term,  is  itself 
larger  at  the  pole  than  on  the  equator.  These  two  effects  are  not  really 
independent,  for  the  flattening  of  the  Earth  is  a  consequence  of  its  rotation. 
We  shall  discuss  this  point  in  the  next  chapter. 

As  another  example  of  the  effect  of  the  centrifugal  force,  we  shall  con- 
sider the  problem  of  a  liquid  in  a  rotating  vessel. 
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Example:  Surface  of  a  rotating  liquid 

Liquid  in  a  vessel  rotating  with  angular  velocity  cu  is  rotating 
with  the  same  angular  velocity.  Find  the  equation  for  its  surface. 

We  use  a  frame  rotating  with  the  vessel,  so  that  in  the  rotating  frame 
the  liquid  is  at  rest,  but  subject  to  the  centrifugal  force.  Now  this  force 
is  conservative,  and  corresponds  to  a  potential  energy 

Kent  =  -\muj2p2  =  -\mu2(x2  +  y2).  (5.20) 

Consider  a  particle  on  the  surface  of  the  liquid.  Unless  the  surface  is 
one  on  which  potential  energy  is  constant,  the  particle  will  tend  to  move 
towards  regions  of  lower  potential  energy.  Hence  if  the  liquid  is  in  equi- 
librium under  the  gravitational  and  centrifugal  forces,  its  surface  must 
be  an  equipotential  surface, 

gz  —  \bj2(x2  +  y2)  =  constant. 

This  is  a  paraboloid  of  revolution  about  the  z-axis. 

This  effect  is  used  in  constructing  some  high-powered  optical  telescopes. 
Because  a  paraboloid  is  the  ideal  shape  to  focus  light  from  directly  above  it, 
the  mirror  is  made  by  spinning  a  thin  film  of  liquid,  for  example  mercury. 
This  is  very  much  cheaper  than  grinding  glass  mirrors  to  a  precise  shape  — 
though  it  has  the  disadvantage  that  the  telescope  can  only  look  at  objects 
near  the  zenith. 

5.4    Coriolis  Force 

The  Coriolis  force,  —2mu>  A  f,  is  an  apparent,  velocity-dependent  force 
arising  from  the  Earth's  rotation.  To  understand  its  physical  origin,  it  may 
be  helpful  to  consider  a  flat,  rotating  disc.  Suppose  that  a  particle  moves 
across  the  disc  under  no  forces,  so  that  an  inertial  observer  sees  it  moving 
diametrically  across  in  a  straight  line.  (See  Fig.  5.6(a).)  Then,  because 
the  disc  is  rotating,  an  observer  on  the  disc  will  see  the  particle  crossing 
successive  radii,  following  a  curved  track  as  in  Fig.  5.6(b).  If  he  is  unaware 
that  the  disc  is  rotating,  he  will  ascribe  this  curvature  to  a  force  acting  on 
the  particle  at  right  angles  to  its  velocity.  This  is  the  Coriolis  force. 

We  may  regard  Fig.  5.6  as  representing  the  Earth  viewed  from  the  north 
pole.  Thus  we  see  that  the  effect  of  the  Coriolis  force  is  to  make  a  particle 
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Fig.  5.6 

deviate  to  the  right  in  the  northern  hemisphere,  and  to  the  left  in  the 
southern.  (It  also  has  a  vertical  component,  in  general.)  It  is  responsible  for 
some  well-known  meteorological  phenomena,  which  we  shall  discuss  below. 

First,  however,  we  examine  some  effects  of  the  Coriolis  force  which  are 
observable  in  the  laboratory.  Let  us  consider  a  particle  moving  near  the 
surface  of  the  Earth  in  colatitude  9.  We  shall  suppose  that  the  distance 
through  which  it  moves  is  sufficiently  small  for  both  the  gravitational  and 
centrifugal  forces  to  be  effectively  constant.  Then  we  may  combine  them 
in  a  constant  effective  gravitational  acceleration  g* .  For  convenience,  we 
shall  drop  the  star,  and  write  this  simply  as  g.  The  equation  of  motion  is 
then 

mir  =  rag  +  F  -  2mw  A  r.  (5-21) 

As  an  example,  let  us  take  a  small  body  falling  freely  under  gravity 
towards  the  Earth's  surface. 

Example:  Freely  falling  body 

A  body  is  dropped  from  rest  at  a  height  h  above  the  ground. 
How  far  from  the  point  directly  below  its  starting  point  does  it 
land? 

To  analyze  this  problem,  it  is  convenient  to  choose  our  axes  so  that 
i  is  east,  j  is  north,  and  k  up.  (See  Fig.  5.7.)  Here  'up'  means  opposite 
to  g* ,  and  the  colatitude  9  is  the  angle  between  that  direction  and  the 
Earth's  axis.  The  angular  velocity  vector  then  has  the  components 

u>  =  (0, 10  sin  9,  uj  cos  9) . 
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(1) 


Fig.  5.7 


Hence  the  Coriolis  force  is 

—2mu>  A  r  =  2mw(ji cos#  —  i sin  6*,  — icos 9,xsm6).  (5.22) 

Neglecting  the  Coriolis  force,  the  motion  is  described  by 

x  =  0,        y  =  0,        z  =  h—^gt2. 

We  shall  calculate  the  effect  of  the  Coriolis  force  only  to  first  order;  that 
is,  we  neglect  terms  of  order  to2 .  Since  the  Coriolis  force  (5.22)  contains 
an  explicit  factor  of  u>,  this  means  that  we  may  substitute  for  r  the 
zeroth-order  value  r  =  (0,0,  —  gt).  The  equations  of  motion  (5.21)  then 
read 

mx  =  2mu>gt  sin  6*,       mij  =  0,       mz  =  —rag. 

The  solution  with  appropriate  initial  conditions  is,  therefore, 

x  =  \u>gt3  sin  6*,        y  =  0,        z  =  h—\gt1. 

Thus  the  body  will  hit  the  ground,  z  =  0,  at  a  point  east  of  that  vertically 
below  its  point  of  release,  at  a  distance 

^=|^y/2sin0.  (5.23) 

For  example,  if  a  particle  is  dropped  from  a  height  of  100  m  in  latitude 
45°,  the  deviation  is  about  16mm. 

It  is  instructive  to  consider  how  an  inertial  observer  would  describe  this 
experiment.  Since  the  particle  is  dropped  from  rest  relative  to  the  Earth, 
it  has  a  component  of  velocity  towards  the  east  relative  to  the  inertial 
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(a)  (b) 

Fig.  5.8 

observer.  As  it  falls,  the  angular  momentum  about  the  Earth's  axis  remains 
constant,  and  therefore  its  angular  velocity  increases,  so  that  it  gets  ahead 
of  the  ground  beneath  it.  Figs.  5.8(a)  and  5.8(b)  show  the  experiment  as 
it  appears  (but  much  exaggerated)  to  an  observer  on  the  Earth  and  to  an 
inertial  observer,  respectively. 

Foucault's  pendulum 

Another  way  of  observing  the  effect  of  the  Coriolis  force  is  to  use  Foucault's 
pendulum.  This  is  simply  an  ordinary  pendulum  free  to  swing  in  any  direc- 
tion, and  carefully  arranged  to  be  perfectly  symmetric,  so  that  its  periods 
of  oscillation  in  all  directions  are  precisely  equal.  (It  must  also  be  long,  and 
fairly  heavy,  so  that  it  will  go  on  swinging  freely  for  several  hours  at  least, 
despite  the  resistance  of  the  air.)  Examples  of  the  Foucault  pendulum  are 
to  be  found  in  many  science  museums. 

If  the  amplitude  is  small,  the  pendulum  equation  is  simply  the  equation 
of  two-dimensional  simple  harmonic  motion.  The  vertical  component  of 
the  Coriolis  force  is  negligible,  for  it  is  merely  a  small  correction  to  g, 
whose  sign  alternates  on  each  half-period.  (Even  for  a  velocity  x  as  large 
as  10ms"1,  we  have  2uix  =  1.5mms-2  -C  g.)  The  important  components 
are  the  horizontal  ones.  For  small  amplitude,  the  velocity  of  the  pendulum 
bob  is  almost  horizontal,  so  that  z  «  0.  Thus  the  equations  of  motion  for 
the  x  and  y  co-ordinates  are 

x  =  —  jx  +  2utycos9,        y  =  —  jy  —  2uix  cos  6,  (5-24) 


or,  in  vector  notation, 

r  = 


2lj  cosOk  A  r. 


(5.25) 
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Let  us  first  suppose  that  the  pendulum  is  at  the  north  pole  (6  =  0). 
Then  it  is  clear  from  the  equation  of  motion  in  a  non-rotating  frame  that 
it  must  swing  in  a  fixed  direction  in  space,  while  the  Earth  rotates  beneath 
it.  Thus,  relative  to  the  Earth,  its  oscillation  plane  must  rotate  around 
the  vertical  with  angular  velocity  —  lo.  Now,  at  any  other  latitude,  the 
only  difference  in  Eq.  (5.25)  is  that  in  place  of  the  angular  velocity  w 
we  have  only  its  vertical  component,  locos  9  k.  Hence,  we  should  expect 
that  the  oscillation  plane  of  the  pendulum  rotates  with  angular  velocity 
— fl  =  —  u;cos#  around  the  vertical.  In  effect,  we  may  regard  the  Earth's 
surface  in  colatitude  9  as  rotating  about  the  vertical  with  angular  velocity 
ui  cos 9  (and  also  with  angular  velocity  uism9  about  a  horizontal  north- 
south  axis,  but  this  component  leads  only  to  a  vertical  Coriolis  force,  which 
does  not  appreciably  affect  the  motion.) 

We  can  verify  this  conclusion  by  obtaining  an  explicit  solution  of 
Eqs.  (5.24).  A  neat  way  of  doing  this  is  to  combine  the  two  equations 
by  introducing  a  complex  variable  z  =  x  +  iy  (not  of  course  to  be  confused 
with  the  vertical  co-ordinate).  Then  y  —  ix  =  —iz,  so  that,  adding  i  times 
the  second  equation  to  the  first,  we  get 


where  fl  =  locos  9  and  lOq  =  g/l.  We  now  look  for  solutions  of  the  form 
2  =  Aept  and,  as  in  §2.5,  find  for  p  the  equation 


The  roots  of  this  equation  are  p  =  —ifl  ±  iwi,  where  lo\  =  lOq  +  fl2.  Hence, 
the  general  solution  of  the  equation  for  z  is 


In  particular,  if  the  pendulum  is  released  from  rest  at  position  (a,  0)  at 
t  =  0,  we  must  choose  A  =  B  =  a/2,  so  that  the  solution  is 


z  +  2if2i  +  lo\z  =  0, 


p2  +  2iflp  +  ul  =  0. 


z  = 


ae 


-ifit 


cos  Wit, 


or,  in  terms  of  x  and  y, 


x  =  a  cos  fit  cos lo it, 


y  =  —  a  sin  fit  cos  lo \t. 
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So  at  time  t,  since  U  -C  0J0  and  wo  —  wi,  the  pendulum  bob  oscillates 
between  the  extreme  points  ±a(cosfif,sinfi£).  The  solution  represents  an 
oscillation  with  amplitude  a  in  a  plane  rotating  with  angular  velocity  —  fi. 

It  should  be  noted  that  in  deriving  (5.24)  we  neglected  terms  of  order 
fi2  (in  particular,  the  variation  in  the  centrifugal  force  is  of  this  order). 
Tims  the  difference  between  the  angular  frequency  uj\  in  our  solution  and 
loq  is  not  significant.  The  period  of  the  pendulum  will  not  be  substantially 
affected. 

At  the  pole,  the  plane  of  oscillation  makes  a  complete  revolution  in  just 
24  hours.  At  any  other  latitude,  the  period  is  greater  than  this,  and  is  in 
fact  2ir I 'u>  cos  6 .  In  latitude  45°,  it  is  about  34  hours,  while  on  the  equator 
(where  u>  is  purely  horizontal),  it  is  infinite. 

Cyclones  and  trade  winds 

There  are  also  some  important  large-scale  effects  of  the  Coriolis  force.  Sup- 
pose that  for  some  reason  a  region  of  low  pressure  develops  in  the  northern 
hemisphere.  The  air  around  it  will  be  pushed  inwards  by  the  force  of  the 
pressure  gradient.  As  it  starts  to  move,  however,  the  Coriolis  force  cause  it 
to  curve  to  the  right.  Thus  an  anticlockwise  rotation  is  set  up  around  the 
low-pressure  zone.  The  process  will  continue  until  an  approximate  equi- 
librium is  established  between  the  pressure  force,  acting  inwards,  and  the 
Coriolis  force  (plus  the  centrifugal  force  of  the  rotation)  acting  outwards. 
This  configuration  is  a  cyclone,  or  depression,  familiar  to  those  who  live 
in  temperate  latitudes.  (As  in  the  case  of  Foucault's  pendulum,  there  is 
no  effect  of  this  kind  on  the  equator.)  More  generally,  in  these  latitudes, 
the  wind  velocity  is  not  directly  from  regions  of  high  to  low  pressure,  but 
more  nearly  along  the  isobars,  keeping  the  low  pressure  on  the  left  in  the 
northern  hemisphere. 

The  same  effect  is  responsible,  on  a  yet  larger  scale,  for  the  trade  winds. 
The  heating  of  the  Earth's  surface  near  the  equator  causes  the  air  to  rise, 
and  be  replaced  by  cooler  air  flowing  in  from  higher  latitudes.  However, 
because  of  the  Coriolis  force,  it  does  not  flow  directly  north  or  south,  but 
is  made  to  deviate  towards  the  west.  Thus  we  have  the  north-east  trade 
winds  in  the  northern  hemisphere,  and  the  south-east  trade  winds  in  the 
southern. 

For  reasons  that  are  too  complex  to  discuss  in  detail  here,  this  pattern 
does  not  extend  to  high  latitudes.  Further  from  the  equator  there  are  belts 
of  high  pressure,  which,  like  the  equatorial  region  itself,  are  characterized 
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by  light  and  variable  winds.  Beyond  these,  the  direction  of  the  pressure 
gradient  is  reversed.  Thus  around  40°  to  50°  N  or  S  the  prevailing  wind 
direction  is  westerly  rather  than  easterly.  Near  the  poles,  there  is  a  further 
reversal,  and  the  circumpolar  winds  blow  from  the  east. 


5.5    Larmor  Effect 

As  a  rather  different  example  of  the  use  of  rotating  frames,  we  shall  consider 
the  effect  of  a  magnetic  field  on  a  particle  of  charge  q  moving  in  an  orbit 
around  a  fixed  point  charge  —q'.  The  equation  of  motion,  including  the 
magnetic  force  (5.8),  is 

d2r         k  dr 

where  k  =  qq' /Aweo.  Let  us  rewrite  this  equation  in  terms  of  a  rotating 
frame  of  reference,  using  (5.7)  and  (5.15).  We  obtain 

H2uAf  +  ii)A(uAr)  =  ^—  r  +  —  (r  +  u>  A  r)  A  B. 

mr1  m 

Now,  if  we  choose  u>  =  —(q/2m)B,  then  the  terms  in  r  drop  out.  The  last 
term  on  the  left  only  cancels  half  the  last  term  on  the  right,  however,  so  we 
obtain 

r  =  k-^r  +  f  tt-)2  -B  A  (B  A  r). 

mr1  \2mJ 

We  now  assume  that  the  magnetic  field  is  sufficiently  weak  for  the  quadratic 
term  in  B  to  be  negligible  in  comparison  to  the  electrostatic  force  term. 
The  necessary  condition  for  this  is  that 

2      ^S\2        k            qq'  2 
w   =    —      <   rr  =  ~  «  Wq,  (5.26) 

where  we  have  used  (4.31)  to  express  the  right  side  in  terms  of  the  mean 
angular  velocity  ui§  of  the  particle  in  its  orbit. 

When  this  condition  is  satisfied,  we  obtain  the  approximate  equation 

k  . 


But  this  is  just  the  usual  equation  of  motion  for  a  particle  in  an  inverse- 
square-law  field.  Consequently,  a  bounded  orbit  in  the  rotating  frame  is  an 
ellipse.  In  the  original,  non-rotating  frame,  it  is  a  slowly  precessing  ellipse, 
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precessing  with  angular  velocity  u>.  (See  Fig.  5.9,  which  is  drawn  for  the 
special  case  in  which  the  orbit  lies  in  the  plane  normal  to  B.  In  general,  it 


Fig.  5.9 

will  be  inclined,  and  the  plane  of  the  orbit  will  precess  about  the  direction 
of  B  )  Note  that,  in  view  of  the  restriction  (5.26),  the  axes  of  the  ellipse 
will  precess  only  through  a  small  angle  on  each  revolution. 

This  effect  is  known  as  the  Larmor  effect,  and  the  angular  velocity  of 
precession, 

qB 

is  called  the  Larmor  frequency.  Note  that  it  is  just  half  the  cyclotron 
frequency  (5.12).  The  difference  arises  from  the  factor  of  2  in  the  Coriolis 
acceleration  term  of  (5.15). 

This  effect  leads  to  observable  changes  in  the  spectra  emitted  by  atoms 
in  the  presence  of  a  magnetic  field,  since  the  Bohr  energy  level  correspond- 
ing to  a  given  angular  momentum  is  slightly  shifted  in  the  presence  of  a 
magnetic  field.  (The  shift  in  spectral  lines  is  known  as  the  Zeeman  effect.) 

When  the  weak  magnetic  field  approximation  is  not  applicable  then  the 
resulting  motion  can  be  very  complex  —  see  §14.4. 

5.6    Angular  Momentum  and  the  Larmor  Effect 

The  Larmor  effect  is  a  particular  example  of  a  more  general,  and  very 
important,  phenomenon.  As  we  shall  see  later  in  a  number  of  different 
examples,  the  effect  of  a  small  force  on  a  rotating  system  is  often  to  make 
the  axis  of  rotation  precess  —  that  is,  revolve  about  some  fixed  direction. 
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It  will  therefore  be  useful  to  consider  here  an  alternative  treatment  of  the 
problem  which  is  of  more  general  applicability. 

If  the  magnetic  field  is  weak,  then  to  a  first  approximation  the  parti- 
cle must  move  in  an  inverse-square-law  orbit  whose  characteristics  change 
slowly  with  time.  Since  the  magnetic  force  is  perpendicular  to  the  velocity, 
it  does  no  work.  Hence  the  energy  is  constant,  and  therefore  so  is  the  semi- 
major  axis  of  the  orbit.  However  the  angular  momentum  J  does  change 
with  time,  according  to  the  equation 

<^-=rAF  =  qrA(vAB)=  q[(r  ■  B)v  -  (r  ■  v)B],  (5.27) 

using  (A.  16). 

To  be  specific,  let  us  suppose  that  the  magnetic  field  is  in  the  z-direction, 
B  =  Bk,  and  that  the  particle  is  moving  in  a  circular  orbit  of  radius  r  in 
a  plane  inclined  to  the  xy-plane  at  an  angle  a.  (See  Fig.  5.10.)  It  will 
be  convenient  to  introduce  three  orthogonal  unit  vectors,  n  normal  to  the 


Fig.  5.10 

plane  of  the  orbit,  a  in  the  direction  of  k  A  n  and  b  =  n  A  a.  Then  the 
angular  momentum  is 

J  =  mr  A  v  =  mrvn.  (5.28) 

If  we  specify  the  position  of  the  particle  in  its  orbit  by  the  angle  ip  between 
a  and  r,  then  in  terms  of  the  axes  a,  b,  n,  the  components  of  r,  v  and  B 
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are 


r  =  (rcostp,rsmip,0), 
v  =  (— v  sin  ijj,v  cos  tp,0), 
B  =  (0,  B  sin  a,  B  cos  a). 


(There  is  also  a  small  contribution  to  v  from  the  fact  that  the  plane  of  the 
orbit  is  changing,  but  for  small  magnetic  fields  this  will  be  negligible.) 


Now,  in  a  circular  orbit,  r  ■  v  is  always  zero.  Hence  the  right  hand  side 
of  (5.27)  is 


Since  the  magnetic  field  is  weak,  the  change  in  J  in  a  single  orbit  will  be 
small.  Thus  the  oscillatory  term  in  (5.29)  is  unimportant:  it  leads  only 
to  a  very  small  periodic  oscillation  in  J.  The  important  term  is  the  one 
which  leads  to  a  secular  variation  of  J  (a  steady  change  in  one  direction). 
It  follows  that  we  can  replace  (5.29)  by  its  average  value  over  a  complete 
oscillation  period,  i.e.,  over  all  values  of  %j>  from  0  to  2ir.  Thus  we  replace 
sin2-0  =  |(1  —  cos2'</j)  by  |,  and  sin  tp  cos  if)  by  0.  This  yields  the  equation 


Thus  the  vector  J  will  move  in  the  direction  of  —a.  Since  dJ /At  is  perpen- 
dicular to  J,  its  magnitude  remains  unchanged,  while  its  direction  precesses 
around  the  direction  of  B.  In  fact,  (5.30)  can  be  seen  to  be  another  example 
of  an  equation  of  the  form  (5.3).  From  (5.28),  we  have 


qBr  sin  a  sin  tp  v  =  qBrv  sina(—  sin2  tp,  sin  tp  costp,  0). 


(5.29) 


\qBrv  sin  a  a. 


(5.30) 


k  A  J  =  mrvk  A  n  =  mrv  sin  a  a. 


Tims  (5.30)  may  be  written 


(5.31) 


This  is  the  equation  of  a  vector  J  rotating  with  angular  velocity 


qB 

LJ  =  —  . 

2m 
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Since  the  direction  of  J  is  that  of  the  normal  to  the  orbital  plane,  this 
agrees  with  our  previous  conclusion  that  the  orbital  plane  precesses  around 
the  direction  of  the  magnetic  field,  with  a  precessional  angular  velocity 
equal  to  the  angular  Larmor  frequency. 

Comparison  with  a  current  loop 

It  is  instructive  to  contrast  this  behaviour  of  a  charged  particle  with  the 
superficially  similar  case  of  a  current  loop.  A  circular  current  loop  in  the 
same  position  as  the  particle  orbit  would  experience  a  similar  force,  produc- 
ing a  moment  about  the  — o-axis.  The  effect  of  the  force,  however,  would 
be  quite  different.  Since  the  loop  itself  (that  is,  its  mass)  is  not  rotating, 
the  effect  would  be  simply  to  rotate  it  about  this  axis,  so  that  its  normal 
tends  to  become  aligned  with  the  direction  of  B.  (This  is  the  familiar  fact 
that  a  current  loop  behaves  like  a  magnetic  dipole.) 

To  understand  why  a  particle  moving  in  an  orbit  behaves  so  differently, 
it  may  be  helpful  to  consider,  in  place  of  a  continuously  acting  force,  a 
small  impulsive  blow  delivered  once  every  orbit,  say  at  the  farthest  point 
above  the  ary-plane.  The  effect  of  such  a  blow  is  to  give  the  particle  a 
small  component  of  velocity  perpendicular  to  its  original  orbit.  The  next 
orbit  will  be  in  a  plane  slightly  tilted  with  respect  to  the  original  one,  but 
reaching  just  as  far  from  the  ary-plane.  The  net  effect  is  to  make  the  orbit 
swing  around  the  2-axis.  The  essential  point  here  is  that  it  is  the  velocity 
of  the  particle,  rather  than  its  position,  that  is  changed  instantaneously. 
Tims  the  effect  of  a  small  blow  is  not  seen  in  a  shift  of  the  orbit  at  the 
point  of  the  blow,  but  rather  at  a  point  90°  later. 

We  shall  see  in  Chapter  9  that  a  rigid  body  displays  a  very  similar  type 
of  behaviour.  If  one  applies  a  small  force  downwards  on  the  rim  of  a  rapidly 
rotating  wheel,  it  is  not  that  point  that  moves  down,  but  a  point  90°  later. 
(This  is  easy  to  verify  with  a  freely  spinning  bicycle  wheel.) 

5.7  Summary 

In  problems  involving  a  rotating  body  —  particularly  the  Earth  —  it  is  often 
convenient  to  use  a  rotating  frame  of  reference.  The  equations  of  motion  in 
such  a  frame  contain  additional  terms  representing  apparent  forces  which 
arise  because  the  frame  is  non-inertial.  These  are  the  centrifugal  force, 
directed  outwards  from  the  axis  of  rotation,  and  the  velocity-dependent 
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Coriolis  force,  -2mwAr.  Because  the  magnetic  force  on  a  charged  particle 
tends  to  produce  rotation  about  the  direction  of  the  magnetic  field,  rotating 
frames  are  also  useful  in  many  problems  involving  a  magnetic  field. 


Problems 

1.  Find  the  centrifugal  acceleration  at  the  equator  of  the  planet  Jupiter 
and  of  the  Sun.  In  each  case,  express  your  answer  also  as  a  fraction  of 
the  surface  gravity.  (The  rotation  periods  are  10  hours  and  27  days, 
respectively,  the  radii  7.1  x  104km  and  7.0  x  105km,  and  the  masses 
1.9  x  1027kg  and  2.0  x  1030kg.) 

2.  Water  in  a  rotating  container  of  radius  50  mm  is  30  mm  lower  in  the 
centre  than  at  the  edge.  Find  the  angular  velocity  of  the  container. 

3.  The  water  in  a  circular  lake  of  radius  1  km  in  latitude  60°  is  at  rest 
relative  to  the  Earth.  Find  the  depth  by  which  the  centre  is  depressed 
relative  to  the  shore  by  the  centrifugal  force.  For  comparison,  find  the 
height  by  which  the  centre  is  raised  by  the  curvature  of  the  Earth's 
surface.  (Earth  radius  =  6400  km.) 

4.  Find  the  velocity  relative  to  an  inertial  frame  (in  which  the  centre  of  the 
Earth  is  at  rest)  of  a  point  on  the  Earth's  equator.  An  aircraft  is  flying 
above  the  equator  at  lOOOkmh-1.  Assuming  that  it  flies  straight  and 
level  (i.e.,  at  a  constant  altitude  above  the  surface)  what  is  its  velocity 
relative  to  the  inertial  frame  (a)  if  it  flies  north,  (b)  if  it  flies  west,  and 
(c)  if  it  flies  east? 

5.  The  apparent  weight  of  the  aircraft  in  Problem  4  when  on  the  ground 
at  the  equator  is  100 1  weight.  What  is  its  apparent  weight  in  each  of 
the  three  cases  (a)-(c)? 

6.  A  bird  of  mass  2  kg  is  flying  at  10ms-1  in  latitude  60°N,  heading  due 
east.  Find  the  horizontal  and  vertical  components  of  the  Coriolis  force 
acting  on  it. 

7.  The  wind  speed  in  colatitude  6  is  v.  By  considering  the  forces  on  a 
small  volume  of  air,  show  that  the  pressure  gradient  required  to  balance 
the  horizontal  component  of  the  Coriolis  force,  and  thus  to  main- 
tain a  constant  wind  direction,  is  dp/dx  =  lujpv  cos  0 ,  where  p  is  the 
density  of  the  air.  Evaluate  this  gradient  in  mbarkm-1  for  a  wind 
speed  of  50kmh_1  in  latitude  30°N.  (lbar  =  105  Pa;  density  of  air  = 
1.3  kgm"3.) 
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8.  An  aircraft  is  flying  at  800  km  h-1  in  latitude  55°N.  Find  the  angle 
through  which  it  must  tilt  its  wings  to  compensate  for  the  horizontal 
component  of  the  Coriolis  force. 

9.  An  orbiting  space  station  may  be  made  to  rotate  to  provide  an  artificial 
gravity.  Given  that  the  radius  is  25  m,  find  the  rotation  period  required 
to  produce  an  apparent  gravity  equal  to  0.7g.  A  man  whose  normal 
weight  is  75  kg  weight  runs  around  the  station  in  one  direction  and 
then  the  other  (i.e.,  on  a  circle  on  the  inside  of  the  cylindrical  wall) 
at  5ms"1.  Find  his  apparent  weight  in  each  case.  What  effects  will 
he  experience  if  he  climbs  a  ladder  to  a  higher  level  (i.e.,  closer  to  the 
axis),  climbing  at  1ms-1? 

10.  A  beam  of  particles  of  charge  q  and  velocity  v  is  emitted  from  a  point 
source,  roughly  parallel  with  a  magnetic  field  B,  but  with  a  small 
angular  dispersion.  Show  that  the  effect  of  the  field  is  to  focus  the  beam 
to  a  point  at  a  distance  z  =  2ivmv/\q\B  from  the  source.  Calculate  the 
focal  distance  for  electrons  of  kinetic  energy  500  eV  in  a  magnetic  field 
of  0.01  T.  (Charge  on  electron  =  -1.6  x  10"19  C,  mass  =  9.1  x  10"31kg, 
leV  =  1.6  x  10"19  J.) 

11.  *  Write  down  the  equation  of  motion  for  a  charged  particle  in  uniform, 
parallel  electric  and  magnetic  fields,  both  in  the  z-direction,  and  solve 
it,  given  that  the  particle  starts  from  the  origin  with  velocity  (v,0,0). 
A  screen  is  placed  at  x  =  a,  where  a  <C  mv/qB.  Show  that  the  locus 
of  points  of  arrival  of  particles  with  given  m  and  q,  but  different  speeds 
v,  is  approximately  a  parabola.  How  does  this  locus  depend  on  m  and 

12.  A  beam  of  particles  with  velocity  (v,  0,  0)  enters  a  region  containing 
crossed  electric  and  magnetic  fields,  as  in  the  example  at  the  end  of 
§5.2.  Show  that  if  the  ratio  E/B  is  correctly  chosen  the  particles  are 
undeviated,  while  particles  with  other  speeds  follow  curved  trajectories. 
Suppose  the  particles  have  velocities  equal  to  v  in  magnitude,  but  with 
a  small  angular  dispersion.  Show  that  if  the  path  length  /  is  correctly 
chosen,  all  such  particles  are  focussed  onto  a  line  parallel  to  the  z-axis. 
(Thus  a  slit  at  that  point  can  be  used  to  select  particles  with  a  given 
speed.)  For  electrons  of  velocity  108ms_1  in  a  magnetic  field  of  0.02T, 
find  the  required  electric  field,  and  the  correct  (smallest  possible)  choice 
for  I. 

13.  The  angular  velocity  of  the  electron  in  the  lowest  Bohr  orbit  of  the 
hydrogen  atom  is  approximately  4  x  1016s_1.    What  is  the  largest 
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magnetic  field  which  may  be  regarded  as  small  in  this  case,  in  the 
sense  of  §5.5?  Determine  the  Larmor  frequency  in  a  field  of  2  T. 
14.  *The  orbit  of  an  electron  (charge  —  e)  around  a  nucleus  (charge  Ze) 
is  a  circle  of  radius  a  in  a  plane  perpendicular  to  a  uniform  magnetic 
field  B.  By  writing  the  equation  of  motion  in  a  frame  rotating  with 
the  electron,  show  that  the  angular  velocity  u  is  given  by  one  of  the 
roots  of  the  equation 


Verify  that  for  small  values  of  B,  this  agrees  with  §5.5.  Evaluate  the 
two  roots  if  B  =  105  T,  Z  =  1  and  a  =  5.3  x  10-11  m.  (Note,  however, 
that  in  reality  a  would  be  changed  by  the  field.) 
15.  *A  projectile  is  launched  due  north  from  a  point  in  colatitude  9  at  an 
angle  7r/4  to  the  horizontal,  and  aimed  at  a  target  whose  distance  is 
y  (small  compared  to  Earth's  radius  R).  Show  that  if  no  allowance  is 
made  for  the  effects  of  the  Coriolis  force,  the  projectile  will  miss  its 
target  by  a  distance 


Evaluate  this  distance  if  9  =  45°  and  y  =  40  km.  Why  is  it  that  the 
deviation  is  to  the  east  near  the  north  pole,  but  to  the  west  both  on 
the  equator  and  near  the  south  pole?  (Neglect  atmospheric  resistance.) 
16.  *  Solve  the  problem  of  a  particle  falling  from  height  h  above  the  equator 
by  using  an  inertial  frame,  and  verify  that  the  answer  agrees  with  that 
found  using  a  rotating  frame.    (Hint:  Use  equations  (3.48).  Recall 


17.  Find  the  equations  of  motion  for  a  particle  in  a  frame  rotating  with 
variable  angular  velocity  u>,  and  show  that  there  is  another  apparent 
force  of  the  form  —muj  A  r.  Discuss  the  physical  origin  of  this  force. 

18.  Find  the  equation  of  motion  for  a  particle  in  a  uniformly  accelerated 
frame,  with  acceleration  a.  Show  that  for  a  particle  moving  in  a  uniform 
gravitational  field,  and  subject  to  other  forces,  the  gravitational  field 
may  be  eliminated  by  a  suitable  choice  of  a. 

19.  *The  co-ordinates  (x,  y,  z)  of  a  particle  with  respect  to  a  uniformly 
rotating  frame  may  be  related  to  those  with  respect  to  a  fixed  inertial 


raw2  -  eBcu  -  Ze2/4ne0a,i  =  0. 


(cos  9 


Fig.  5.8.) 
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frame,  (x* ,y* ,  z*),  by  the  transformation 


X 

cos  ujt    sin  u>t  0 

~x*~ 

y 

—  sinujt  cos  u>t  0 

y* 

_z_ 

0          0  1 

(Here,  we  use  matrix  notation:  this  stands  for  three  separate  equations, 

x  =  cos  cot  ■  x*  +  s'mujt  ■  y*, 

etc.)  Write  down  the  inverse  relation  giving  (x* ,  y* ,  z*)  in  terms  of 
(x,y,z).  By  differentiating  with  respect  to  t,  rederive  the  relation 
(5.15)  between  d2r/dt2  and  r.  [Hint:  Note  that  r  =  (x,y,z),  while 
d2r/di2  is  the  vector  obtained  by  applying  the  above  transformation 
to  (x*,y*,z*).] 

20.  Another  way  of  deriving  the  equation  of  motion  (5.16)  is  to  use  La- 
grange's equations.  Express  the  kinetic  energy  ^m(dr / dt)2  in  terms  of 
(x,y,z),  and  show  that  Lagrange's  equations  (3.44)  reproduce  (5.16) 
for  the  case  where  the  force  is  conservative. 


Chapter  6 
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This  chapter  is  complementary  to  the  preceding  ones.  In  it  we  shall  discuss 
not  the  problem  of  determining  the  motion  of  a  particle  under  known  forces, 
but  the  problem  of  finding  the  forces  from  a  knowledge  of  the  positions  of 
other  bodies.  We  deal  specifically  with  gravitational  and  electrostatic  forces 
obeying  the  inverse  square  law,  which  are  determined  by  the  positions  of 
other  masses  and  charges. 

We  return  here  to  the  convention  whereby  r  =  dr/dt.  We  shall  not  use 
rotating  frames  again  until  Chapter  9. 

6.1     Gravitational  and  Electrostatic  Potentials 

The  gravitational  potential  energy  of  a  particle  of  mass  m  moving  in  the 
field  of  a  fixed  mass  m!  at  r'  is  —Gmm'/\r  —  r'\.  If  we  have  several  masses 
rrij,  located  at  the  points  rj,  then  the  potential  energy  is  the  sum 


(The  fact  that  the  potential  energies  add  follows  from  the  additive  property 
of  forces.) 

Since  the  mass  m  appears  only  as  an  overall  factor,  we  may  define  the 
gravitational  potential  <t>(r)  to  be  the  potential  energy  per  unit  mass 


V(r)  =  m$(r), 


(6.1) 


so  that 


(6.2) 
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Note  that  $  is  always  negative.  (The  potential  has  sometimes  been  defined 
to  be  minus  the  potential  energy  per  unit  mass,  and  therefore  positive.  We 
prefer,  however,  to  retain  the  direct  correspondence  between  potential  and 
potential  energy,  so  that  particles  tend  to  move  towards  regions  of  lower 
potential.) 

The  acceleration  of  a  particle  moving  under  gravitational  forces  is  given 

by 

mr  =  -W(r)  =  -mV$(r). 

Since  this  acceleration  is  independent  of  the  mass  m,  we  may  define  the 
gravitational  acceleration  or  gravitational  field  g(r)  by 

g(r)  =  -V$(r),  (6.3) 

Tims  (6.2)  and  (6.3)  are  all  that  is  needed  to  calculate  the  acceleration 
induced  in  a  particle  by  a  given  distribution  of  masses. 

The  electrostatic  case  is  very  similar.  We  define  the  electrostatic  poten- 
tial 4>{r)  to  be  the  electrostatic  potential  energy  per  unit  charge, 

V(r)  =  #(r).  (6.4) 

The  potential  due  to  charges  qj  at  r3  is  then 

4>(r)  =  y-  ^  r.  (6.5) 

3  J 

The  acceleration  of  a  charged  particle  is  given  by 

mr  =  qE, 

where  the  electric  field  E(r)  is  defined  by 

E(r)  =  -V<p(r),  (6.6) 

Note  that  the  acceleration  depends  on  the  charge-to-mass  ratio  q/m.  (The 
corresponding  ratio  in  the  gravitational  case  is  the  ratio  of  gravitational  to 
inertial  mass,  which  is  of  course  a  constant,  set  equal  to  unity  in  conven- 
tional units.  This  proportionality  is  the  content  of  the  equivalence  princi- 
ple.) 

Unlike  the  gravitational  potential,  the  electrostatic  potential  <fi  may  have 
either  sign,  because  both  positive  and  negative  charges  exist. 
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We  can  now  forget  about  the  particle  at  r,  and  concentrate  on  the  prob- 
lem of  calculating  the  potential  from  given  information  about  the  positions 
of  the  masses  or  charges. 


6.2    The  Dipole  and  Quadrupole 

The  electric  dipole  consists  of  two  equal  and  opposite  charges,  q  and  —q, 
placed  close  together,  say  at  a  and  at  the  origin,  respectively.  (See  Fig.  6.1.) 
The  potential  is 


Fig.  6.1 


Q 


Q 


4ireo\r  —  a\  47reo|T" 


(6.7) 


We  shall  assume  that  a  «  r,  so  that  we  may  expand  by  the  binomial 
theorem.  Since  we  shall  need  the  result  later,  we  evaluate  the  terms  up  to 
order  a2,  but  neglect  a3.  If  6  is  the  angle  between  a  and  r,  then 

\r  —  a\2  =  r2  —  2ar  cos  6*  +  a2. 


Hence 
1 


-  1 


„  a  „  or 
2-cos#+  -7- 


-1/2 


1 

X"2 


„a  a 
-2- cos9  +  — 


3 

+  - 


a 

-2—  cos( 
r 


+  — 


-  +  ^cos0+^(hoS20-^)  +  ---  .  (6.8) 

(The  general  term  in  this  series  is  of  the  form  (a1  /rl+1)Pi(cosO)7  where 
Pi  is  a  polynomial,  known  as  the  Legendre  polynomial  of  degree  I.  Here, 
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however,  we  shall  only  need  the  terms  up  to  I  =  2.)  In  vector  notation,  we 
may  write  (6.8)  as 

1  1      ar     3(a-r)2-a2r2  .  , 

Tzr^  =  z  +  ^r  +  - — L  +  •••  •  (6-9) 


We  now  return  to  (6.7) .  Keeping  only  the  linear  term  in  a  and  neglecting 
a2,  we  can  write  it  as 

, .  .       d-  r       dcosd  .     , . 

4>  (r)  =  -.  ~  =  -.  ~,  6.10 

where  d  is  the  electric  dipole  moment,  d  =  qa.  The  corresponding  electric 
field,  given  by  (6.6),  has  the  spherical  polar  components  (see  (A. 50)) 

d<j>  2d  cos  9 

dr  4-7reor3 
ld(p  d  sin  6 

Ee  =  -rde      =wf  (  } 

r  sin  C  at/? 

(To  avoid  any  possible  risk  of  confusion,  we  denote  the  potential  and  the 
azimuth  angle  by  the  distinct  symbols  <f>  and  ip.)  This  field  is  illustrated  in 
Fig.  6.2,  in  which  the  solid  lines  are  field  lines,  drawn  in  the  direction  of 
E,  and  the  dashed  lines  are  the  equipotential  surfaces,  <fi  =  constant.  Note 
that  the  two  always  intersect  at  right  angles. 

We  can  repeat  the  process  of  putting  two  charges  (or  monopoles) 
together  to  form  a  dipole,  by  putting  two  dipoles  together  to  form  a 
quadrupole.  If  we  place  oppositely  oriented  dipoles,  with  dipole  moments  d 
and  —  d,  at  a  and  at  the  origin,  then  the  potential  is 

Afr)  =    d  '  (r  ~  Q)    _    d  r 

47reo|r  —  a|3  Aireor3 

Now,  neglecting  terms  of  order  a2,  we  have 


\r  —  a\°  r 
Hence,  to  this  approximation 


1  /        a-rx-3/2      1  „a-r 


3(d-  r)(a  •  r)  -  (d  ■  a)r2 
4-Keor5 


(6.12) 
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Fig.  6.2 


In  the  special  case,  where  the  dipoles  are  placed  end-on,  so  that  d  is 
parallel  to  a,  we  may  take  this  common  direction  to  be  the  z-axis,  and 
obtain 


0(r) 


167re()r3 


(3  cos2  (9-  1), 


(6.13) 


where  Q  is  the  electric  quadrupole  moment,  Q  =  Ada.  (The  factor  of  4 
is  purely  conventional,  and  serves  to  simplify  some  of  the  later  formulae.) 
The  corresponding  electric  field  is 


Er 


Ea 


3Q 


167reof4 
3Q 


(3cos26»-  1), 


87re()r' 


-  cos  U  sin  ( 


(6.14) 


Ev  =  0. 


It  is  illustrated  in  Fig.  6.3. 
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Fig.  6.3 


6.3    Spherical  Charge  Distributions 

When  we  have  a  continuous  distribution  of  charge  (or  mass)  we  can  replace 
the  sum  in  (6.5)  by  an  integral: 

«'>  =  ///i^FbidV'  <6-15> 

where  p(r')  is  the  charge  density,  and  d3r'  denotes  the  element  of  volume. 
In  spherical  polars,  the  volume  element  is  the  product  of  the  elements  of 
length  in  the  three  co-ordinate  directions,  namely  (see  §3.5) 

dV  =  da;'  dy'  dz'  =  r,2dr'  sin  9'd9'  dip'. 

Note  that  6'  ranges  from  0  — >  n,  and  ip'  from  0  — >  2ir. 

We  shall  consider  in  this  section  spherically  symmetric  distributions  of 
charge,  for  which  p  is  a  function  only  of  the  radial  co-ordinate  r' .  Let  us 
first  take  a  uniform  thin  spherical  shell  of  charge  density  p,  radius  a  and 
thickness  da.  Choosing  the  direction  of  r  to  be  the  z-axis,  we  can  write 
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(6.15)  as 

pa2  da  f  f  sin  6'  dO'  dip' 


4tt£o  J  J  (r2  -  2ar  cosd'  +  a2)1/2  ' 

The  iff  integration  gives  a  factor  of  2it.  The  6'  integration  can  easily  be 
performed  by  the  substitution  u  =  cos  9',  and  yields 


(r)  =  pa  da 


2eoar 


We  now  have  to  consider  separately  the  cases  r  >  a  and  r  <  a.  In  terms  of 
the  total  charge  dq  =  4ivpa2da  of  the  spherical  shell,  we  obtain 


dq 


dq 


ATTeoa ' 


r  >  a, 

(6.16) 

r  <  a. 


Outside  the  shell,  the  potential  is  the  same  as  that  of  a  charge  dq  concen- 
trated at  the  origin,  and  the  electric  field  is  E  =  rdq/Aneor2.  Inside,  the 
potential  is  a  constant,  and  the  electric  field  vanishes. 

It  is  now  easy  to  find  the  potential  of  any  spherically  symmetric  distri- 
bution, by  summing  over  all  the  spherical  shells.  It  is  clear  that  the  electric 
field  at  a  distance  r  from  the  centre  is  equal  to  that  of  a  point  charge  lo- 
cated at  the  origin,  whose  magnitude  is  the  total  charge  enclosed  within  a 
sphere  of  radius  r.  (A  similar  result  holds  in  the  gravitational  case.) 

A  simple  example  is  a  uniformly  charged  sphere. 

Example:  Uniformly  charged  sphere 

What  is  the  potential  and  electric  field  for  a  sphere  of  radius  a, 
with  constant  charge  density  p  inside  it? 

For  r  >  a  the  potential  is  clearly  just 

<P(r)  =  ~r~^        with       1  =  t^P- 

Inside  the  sphere,  we  have  to  separate  the  contributions  from  the  regions 
r'  <  r  and  r'  >  r,  and  obtain 


r)=r^dr'+r^dr'. 

JO      e0r  Jr  «0 
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Performing  the  integrations,  we  find 

9 


(r) 


4:ne0r ' 
q  /3 


r  >  a, 


r  <  a. 


(6.17) 


Uvreo  V2a      2«3  J  ' 
This  potential  is  illustrated  in  Fig.  6.4.  The  corresponding  electric  field 


Fig.  6.4 


E{r) 


r  >  a. 


4T2'  (6-18) 
r  <  a. 


Thus,  outside  the  sphere,  the  field  obeys  the  inverse  square  law. 
Inside,  it  increases  linearly  from  zero  at  the  centre  to  the  surface  value 
q/4-Ke0a2. 

So  far  as  its  effect  on  outside  bodies  is  concerned,  any  spherically  sym- 
metric distribution  of  charge  or  mass  may  be  replaced  by  a  point  charge  or 
mass  located  at  the  centre.  This  is  the  justification  for  treating  the  Sun, 
for  example,  as  a  point  mass  in  discussing  the  motion  of  the  planets.  Of 
course,  any  deviation  from  spherical  symmetry  will  lead  to  a  modification 
in  the  inverse  square  law.  We  shall  discuss  the  nature  of  this  correction 
later. 
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6.4    Expansion  of  Potential  at  Large  Distances 

It  is  only  for  simple  cases  that  we  can  calculate  the  potential  exactly.  In 
general,  we  must  use  approximation  methods.  In  particular,  we  are  often 
interested  in  the  form  of  the  potential  at  distances  from  the  charge  or  mass 
distribution  which  are  large  compared  to  its  dimensions.  Then  in  (6.5)  or 
(6.15),  r  is  much  larger  than  rj  or  r' ,  and  we  may  use  the  series  expansion 
(6.8)  to  obtain  an  expansion  of  the  potential  in  powers  of  1/r: 

0(r)  =  Mr)  +  Mr)  +  Mr)  +  ■■■  ■  (6.19) 
The  leading  ('monopole')  term  is 

Mr)  =  ^  (6-20) 

where  q  is  the  total  charge 

<Z  =  X>       or       <?  =  ///  P(r')  dV.  (6.21) 

Thus  at  very  great  distances,  only  the  total  charge  is  important,  not  the 
shape  of  the  distribution.  This  is  the  potential  we  should  get  for  an  exactly 
spherically  symmetric  distribution.  Thus  the  further  terms  in  the  series 
(6.19)  may  be  regarded  as  measuring  deviations  from  spherical  symmetry. 
The  next  term  in  (6.19)  is 

d  ■  r 

where  d  is  the  dipole  moment,  defined  by 


Mr)  =  7—3,  (6-22) 


d  =  J2<bri       or       d  =  jjj  p(r')r' d3r'.  (6.23) 

If  we  shift  the  origin  through  a  distance  R,  the  total  charge  is  clearly 
unaffected,  but  the  dipole  moment  changes  to 

d  =       Qj(rj  -  R)  =  d-qR.  (6.24) 

3 

Thus,  if  the  total  charge  is  non-zero,  we  can  always  make  the  dipole  moment 
vanish  by  choosing  the  origin  to  lie  at  the  centre  of  charge 

R=Z^1=d  (6.25) 
Eft  Q 
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In  this  case,  the  dipole  moment  gives  us  information  about  the  position  of 
the  centre  of  charge.  However,  if  the  total  charge  is  zero,  then  by  (6.24)  the 
dipole  moment  is  independent  of  the  choice  of  origin  —  the  electric  dipole 
and  quadrupole  discussed  in  the  preceding  section  are  examples  of  this. 

Note  that  in  the  gravitational  case,  the  total  mass  can  never  vanish, 
so  we  can  always  ensure  that  the  gravitational  'dipole  moment'  is  zero  by 
choosing  our  origin  at  the  centre  of  mass. 

The  'quadrupole'  term  in  the  potential  (6.19),  obtained  by  substituting 
the  third  term  of  (6.9)  into  (6.15),  is 

Mr)  =  HI  P(r')3{r'rZ~/r'' (6-26) 

One  can  write  this  expression  in  a  form  similar  to  (6.12).  However,  we 
shall  not  consider  the  general  case  here  (see  Appendix  A,  Problem  17), 
but  restrict  the  discussion  to  the  special  case  of  axial  symmetry,  where  p 
depends  on  r'  and  8'  (or  p'  and  z')  but  not  on  tp1 .  This  case  is  of  particular 
interest  in  connection  with  the  gravitational  potential  of  the  Earth,  which  is 
to  a  good  approximation  axially  symmetric,  though  flattened  at  the  poles. 
We  shall  see  that  in  the  case  of  axial  symmetry,  (6.26)  can  be  written  in  a 
form  similar  to  (6.13)  with  a  suitably  defined  quadrupole  moment  Q. 

To  do  this,  we  examine  the  numerator  of  the  integrand  in  (6.26).  Written 
out  in  terms  of  components,  it  reads 

xi(2x'2  -  y'2  -  z'2)  +  y2(2y'2  -  x'2  -  z'2)  +  z2(2z'2  -  x'2  -  y'2) 

+  6xyx'y'  +  6xzx'z'  +  6yzy'z'.  (6.27) 

Now,  because  of  axial  symmetry,  any  integral  involving  an  odd  power  of  x' 
or  y'  will  vanish,  for  example 


///' 


p(r')x'z'  dV  =  0, 

because  the  contributions  from  (x',y',z')  and  (—x',y',z')  exactly  cancel. 
Moreover,  axial  symmetry  also  implies  that 

HI  p(r')x'2  dV  =  m  p(r')y'2  dV, 

since  the  x-  and  y-axes  are  in  no  way  distinguished  from  one  another.  Hence 
we  can  replace  2x'2  —  y'2  —  z'2  in  the  integrand  by  \{x12  +y'2)  —  z'2  without 
affecting  the  value  of  the  integral.  When  we  do  this,  we  see  that  the  three 
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remaining  terms  of  (6.26)  all  involve  the  same  integral,  namely 

Q  =  HI  p(r'){2z12  -  x'2  -  y'2)  dV '.  (6.28) 
Thus  (6.26)  can  be  written 

This  is  identical  with  the  potential  (6.12)  of  an  axially  symmetric 
quadrupole.  The  quantity  Q  defined  by  (6.28)  is  the  quadrupole  moment 
of  the  distribution. 

Example:  Linear  quadrupole 

Two  equal  charges  q  are  positioned  at  the  points  (0,  0,  ±a),  and 
a  charge  —  2q  at  the  origin.  Find  the  quadrupole  moment,  and 
write  down  the  potential  and  electric  field  at  large  distances. 

Here  the  central  particle  contributes  nothing  to  Q,  and  the  other  two 
each  give  the  same  contribution,  so  Q  =  2q(2a2)  =  Aqa2.  The  potential 
and  electric  field  at  large  distances  are  given  by  (6.13)  and  (6.14)  with 
this  value  of  Q. 

Thus  we  see  that  a  better  approximation  than  treating  a  charge  distri- 
bution as  a  single  point  charge  is  to  treat  it  as  a  point  charge  plus  an  electric 
quadrupole  of  moment  given  by  (6.28).  For  a  spherically  symmetric  charge 
distribution,  Q  =  0,  because  the  integrals  over  x'2,  y'2  and  z'2  all  yield  the 
same  value.  The  value  of  Q  may  be  regarded  as  a  measure  of  the  flattening 
of  the  distribution.  For  a  distribution  with  uniform  positive  charge  density, 
Q  is  positive  if  the  shape  is  prolate  (egg-shaped,  with  the  z-axis  longer  than 
the  others),  and  negative  if  it  is  oblate,  like  the  Earth.  In  particular,  for 
a  spheroid  (ellipsoid  of  revolution),  with  uniform  charge  density  and  semi- 
axes  a,  a,  c,  we  can  evaluate  the  integral  in  (6.28)  explicitly.  We  shall  have 
occasion  to  evaluate  a  very  similar  integral  in  connection  with  moments  of 
inertia  in  Chapter  9,  and  therefore  omit  the  details  here.  The  result  is 


Q  =  lq(c2  -  a2). 


(6.30) 
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6.5    The  Shape  of  the  Earth 

The  Earth  is  approximately  an  oblate  spheroid  whose  equatorial  radius  a 
exceeds  its  polar  radius  c  by  about  21.4  km.  Its  oblateness  is  defined  to  be 

e=iZ£„    1  (6.31) 
a        297  y  ' 

Its  gravitational  potential  is  therefore  not  precisely  the  inverse-square-law 
potential,  —GM/r.  The  most  important  correction  is  the  'quadrupole' 
term, 

&2(r)  =  -^(3cos20-l). 

Since  the  Earth  is  oblate,  Q  here  is  actually  negative. 

If  the  density  of  the  Earth  were  uniform,  the  quadrupole  moment  would 
be  given  simply  by  (6.30): 

Q  =  -§M(a2  -  c2)  «  -§Ma2e. 

Here  we  have  used  c  =  a(l  —  e),  and  dropped  terms  of  order  e2.  In  fact, 
the  density  is  considerably  greater  near  the  centre  than  at  the  surface,  so 
that  large  values  of  r'  contribute  proportionately  less  to  the  integral  (6.28). 
Thus  we  must  expect  Q  to  be  rather  smaller  in  magnitude  than  this  value 
(see  Problem  15).  It  will  be  convenient  to  write 

Q  =  -2Ma2J2, 

where  J2  is  a  dimensionless  parameter,  somewhat  less  than  |e,  defining 
the  magnitude  of  the  quadrupole  moment.  (In  general,  J\  parametrizes  the 
term  in  $  proportional  to  Pi(cosO)  —  see  the  note  following  Eq.  (6.8).) 
Then  the  gravitational  potential  becomes 

$(r)«  —  +  — ^3-^ (3  cos2  (9  -  l).  (6.32) 

We  shall  see  that  both  e  and  J2  can  be  determined  from  measurements  of 
the  acceleration  due  to  gravity  and  the  use  of  an  equilibrium  principle. 

The  flattening  of  the  Earth  is  in  fact  a  consequence  of  its  rotation. 
Over  long  periods  of  time,  the  Earth  is  capable  of  plastic  deformation,  and 
behaves  more  nearly  like  a  liquid  than  a  solid  —  and  of  course  the  larger- 
part  of  the  surface  is  covered  by  a  liquid.  Although  its  crust  does  have 
some  rigidity,  it  could  not  preserve  its  shape  for  long  unless  it  were  at 
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least  approximately  in  equilibrium  under  the  combined  gravitational  and 
centrifugal  forces. 

Now,  as  in  the  problem  discussed  at  the  end  of  §5.3,  the  surface  of  a 
liquid  in  equilibrium  under  conservative  forces  must  be  an  equipotential 
surface,  for  otherwise  there  would  be  a  tendency  for  the  liquid  to  flow 
towards  regions  of  lower  potential.  We  can  include  the  centrifugal  force  by 
adding  to  the  potential  a  term 

$cent(r)  =  -  |^V  +V2)  =  -  ^Vsin2^. 

The  equipotential  equation  is  then 

$(r)  +  $cent(r)  =  constant,  (6.33) 

with  $  given  by  (6.32).  Thus  equating  the  values  at  the  pole  (8  =  0)  and 
on  the  equator  (6  =  tt/2),  we  obtain  the  equation 


GM     GMa2J2  _     GM     GMJ2  1 


2a 


2  2 

u>  a  . 


Since  e  and  J2  are  both  small  quantities,  we  may  neglect  e2  and  ej2.  Thus, 
in  the  quadrupole  term,  we  may  ignore  the  difference  between  c  and  a.  In 
the  inverse-square-law  term,  we  can  write  1/c  «  (l/a)(l  +  e).  Hence  to  first 
order  we  find 

GMe     3GMJ2        ,   ,  , 

 +  — ^  =  -3W  a  • 

a  2a  z 

Writing  go  for  the  gravitational  acceleration  without  the  centrifugal  term, 
go  =  GM/a2,  and  multiplying  by  2 /ago,  we  obtain 

2 

—  =  2e  -  3J2.  (6.34) 

go 

If  we  made  the  approximation  of  treating  the  Earth  as  of  uniform  den- 
sity, and  therefore  set  J2  =  |e,  then  this  equation  would  determine  the 
oblateness  in  terms  of  the  angular  velocity.  This  yields  e  «  1/230,  which  is 
appreciably  larger  than  the  observed  value. 

On  the  other  hand,  if  we  regard  e  and  J2  as  independent,  then  we  need 
another  relation  to  fix  them  both.  Such  a  relation  can  be  found  from  the 
measured  values  of  g.  (See  §5.3.)  Now,  the  gravitational  field  g  =  —  V<1> 
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corresponding  to  the  potential  (6.32)  is 

GM     SGMa2J2.n     ,  „ 
9r  =  ~-^+      2r4  (3cos2e-l), 

r    0        2r  (6.35) 
3GMa2J2  K  ' 

ge  =  j  cos  6>  sin  6>. 

Note  that  50  is  directed  away  from  the  poles,  and  towards  the  equator, 
as  one  might  expect  from  thinking  of  it  as  due  to  the  attraction  of  the 
equatorial  bulge.  The  inward  radial  acceleration  —  gr  is  decreased  at  the 
poles,  and  increased  at  the  equator,  by  the  quadrupole  term,  so  that  the 
difference  between  the  two  values  is  not  as  large  as  the  inverse  square  law 
would  predict. 

At  the  poles,  and  on  the  equator,  gg  =  0,  and  the  magnitude  of  g  is 
equal  to  —  gr.  Thus  we  find 

GM     3GMa2J2  „  , 

ffpoie  =   —A         «  ffo(l  +  2e  -  3J2) 


GM  3GMJ2 
<?eq  =  —  +     2fl2        w  ffo(l  +  2^ 


^2  J 


making  the  same  approximations  as  before.  Thus,  for  the  difference,  we 
find  the  expression 

Ag  =  gpole-fleq=2£_9j2 

go         go  2 

For  the  value  including  the  centrifugal  term,  Ag*  =  Ag  +  ui2a,  we  have  to 
add  (6.34)  to  (6.36).  This  yields 

Ag*      a       15  T  la  vr\ 

—  =4e-— J2.  (6.37) 

go  * 

Using  the  known  values  quoted  in  (5.18)  and  (5.19),  we  may  solve  the 
simultaneous  equations  (6.34)  and  (6.37)  for  e  and  J2.  This  yields 

e  =  0.0034,        J2  =  0.0011. 

The  value  of  e  is  in  good  agreement  with  the  the  oblateness  deduced  from 
direct  measurements  of  the  polar  and  equatorial  radii.  As  expected,  the 
value  of  J2  is  somewhat  less  than  |e.  The  relation  between  the  two  is 
determined  by  the  density  distribution  within  the  Earth.  (See  Problem 
15.) 

One  consequence  of  the  fact  that  the  Earth's  surface  is  approximately 
an  equipotential  surface  should  be  noted.  Because  of  the  general  property 
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that  the  force  field  is  always  perpendicular  to  the  equipotentials,  it  means 
that  the  effective  g* ,  which  is  the  field  derived  from  $  +  3>Cent>  is  always 
perpendicular  to  the  Earth's  surface.  Thus,  although  a  plumb  line  does  not 
point  towards  the  Earth's  centre,  it  is  perpendicular  to  the  surface  at  that 
point  —  apart,  of  course,  from  minor  deviations  caused  by  mountains  and 
the  like.  (Compare  Chapter  1,  Problem  10.) 

Satellite  orbits 

The  'quadrupole'  term  in  the  Earth's  gravitational  field  has  two  important 
effects  on  the  orbit  of  a  close  artificial  satellite.  Indeed,  observations  of 
such  orbits  provide  the  most  reliable  means  of  measuring  its  magnitude 
(and  those  of  the  even  smaller  higher-order  correction  terms).  They  have 
revealed  quite  substantial  deviations  from  spheroidal  shape  and  from  hy- 
drostatic equilibrium:  the  Earth's  surface  is  only  rather  approximately  an 
equipotential. 

The  first  effect  of  the  quadrupole  term,  arising  mainly  from  the  deviation 
of  the  radial  component  of  g  from  the  inverse  square  law,  is  a  precession  of 
the  major  axis  of  the  orbit  within  the  orbital  plane.  (Compare  Chapter  4, 
Problem  27.)  The  major  axis  precesses  in  the  forward  direction  for  orbits  of 
small  inclination  to  the  equator,  and  in  the  retrograde  direction  for  orbits 
with  inclination  greater  than  arcsin  ^/4/5  =  63.4°.  (The  difference  is  a 
reflection  of  the  differing  sign  of  the  quadrupole  contribution  to  gr  on  the 
equator  and  near  the  poles.) 

The  second  effect  occurs  because  the  force  is  no  longer  precisely  central, 
so  that  the  angular  momentum  changes  with  time  according  to 

^  =  mr  A  g.  (6.38) 

Tims,  as  in  the  case  of  the  Larmor  effect  discussed  in  §5.6,  the  orbital  plane 
precesses  around  the  direction  of  the  Earth's  axis.  The  rate  of  precession 
may  be  calculated  by  a  method  very  similar  to  the  one  used  there  (see 
Problem  25).  The  precession  is  in  fact  greater  for  orbits  of  small  inclination, 
and  is  zero  for  an  orbit  passing  over  the  poles.  For  a  close  satellite  with 
small  inclination,  it  can  be  nearly  10°  per  day.  It  is  always  in  a  retrograde 
sense  (opposite  to  the  direction  of  revolution  of  the  satellite  in  its  orbit). 

Both  effects  are  strongly  dependent  on  the  radius  of  the  satellite  orbit. 
In  fact,  the  rate  of  precession  decreases  like  r-7/2.  At  the  radius  of  the 
Moon's  orbit,  the  precessional  angular  velocity  is  only  a  few  seconds  of  arc 
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per  year.  (The  Moon's  orbit  does  precess,  at  about  19°  per  year,  but  this 
is  a  consequence  of  the  non-uniformity  of  the  Sun's  gravitational  field,  not 
of  the  shape  of  the  Earth.  See  Problem  26.) 

Planetary  precession 

It  should  be  noted  that  a  small  oblateness  of  the  Sun  produces  a  slow 
precession  of  the  Kepler  orbits  around  it. 

The  major  part  of  the  precession  of  planetary  orbits  is  due  to  the  gravi- 
tational action  of  the  planets  on  each  other,  leading  for  example  in  the  case 
of  Mercury  to  a  predicted  precession  of  the  axes  of  the  ellipse  of  about  531 
seconds  of  arc  every  century.  These  effects  can  be  modelled  quite  well  for 
each  planet  by  smearing  the  mass  of  each  of  the  other  planets  into  an  ap- 
propriate coplanar  uniform  circular  ring.  The  modifications  to  the  inverse 
square  law  potential  depend,  of  course,  on  whether  the  planetary  distance 
being  considered  is  inside  or  outside  the  ring. 

For  Mercury  (alone)  there  is  an  observed  residual  precession  of  43"  per 
century,  not  explainable  by  the  Newtonian  theory.  In  fact,  general  relativity 
does  explain  this  precession  —  indeed  that  it  does  so  is  one  of  the  celebrated 
tests  of  the  theory. 


6.6    The  Tides 

Tidal  forces  arise  because  the  gravitational  attraction  of  the  Moon,  and  to 
a  lesser  extent  of  the  Sun,  is  not  uniform  over  the  surface  of  the  Earth. 
The  attraction  is  stronger  than  average  on  the  side  of  the  Earth  facing  the 
Moon,  and  weaker  than  average  on  the  far  side,  so  there  is  a  tendency  for 
the  Earth  to  be  elongated  along  the  line  of  centres. 

Let  a  be  the  position  of  the  Moon  relative  to  the  Earth's  centre,  and 
consider  a  point  r  on  the  Earth.  The  potential  at  this  point  is  (see  Fig.  6.5) 


$(r) 


Gm 


(6.39) 


r  — 


a 


where  m  is  the  mass  of  the  Moon.  Now,  since  r  -C  a,  we  may  expand  in 
powers  of  r/a.  (This  is  the  reverse  of  the  situation  encountered  in  §6.2, 
where  r  was  much  larger  than  a.)  Taking  the  direction  of  the  Moon  to  be 
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r 

1  a  ' 

Fig.  6.5 


the  ^-direction,  and  using  (6.8),  we  obtain 


<J>(r)  =  -Gm 


2 


-  +         COs6»  +  ^r(§  COS2  6  -  §)  + 


The  first  term  is  a  constant  and  does  not  yield  any  force.  The  second  may 
be  written 

$i  r  =  Tz- 

It  yields  a  uniform  gravitational  acceleration  Gm/a2  directed  towards  the 
Moon.  This  term  therefore  describes  the  major  effect  of  the  Moon's  gravi- 
tational force,  which  is  to  accelerate  the  Earth  as  a  whole.  It  is  irrelevant  in 
discussing  the  phenomenon  of  the  tides,  since  we  are  interested  in  motion 
relative  to  the  centre  of  the  Earth.  In  fact,  we  can  eliminate  it  by  going 
over  to  an  accelerated  frame,  the  frame  in  which  the  Earth's  centre  of  mass 
is  at  rest.  (See  Chapter  5,  Problem  18.) 

The  important  term  for  our  purposes  is  the  quadratic  term,  which  leads 
to  a  gravitational  field 

Gmr  .       9  „  . 
gr  =  -^-(3cos26»-  1), 

°  (6-40) 

ge  =  5 —  cos  0  sin  0. 

This  field  is  directed  outwards  along  the  z-axis,  towards  and  away  from  the 
Moon,  and  inwards  in  the  xy-plane,  as  expected. 

It  should  be  noted  that  the  field  described  by  (6.40)  is  much  weaker 
even  than  the  corrections  to  the  Earth's  field  discussed  in  the  preceding 
section.  (If  it  were  not,  it  would  show  up  in  measurements  of  g.)  It  is  easy 
to  compute  its  magnitude.  It  is  smaller  than  go  =  GM/r2  by  the  factor 
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mr3/Ma3.  For  the  Moon,  m/M  =  1/81.3  and  r/a  =  1/60.3,  so  that  this 
factor  is 

771 

— —  =  5.60  x  10"8.  (6.41) 

Ma3  v  ; 

If  we  denote  the  mass  of  the  Sun  by  m!  and  its  mean  distance,  the  semi- 
major  axis  of  the  Earth's  orbit,  by  a',  then  the  corresponding  values  for 
the  Sun  are  m'/M  =  3.33  x  105  and  r/a'  =  4.26  x  10"5.  Thus  the  factor 
in  that  case  is 

=  2-57  x  10-*.  (6.42) 

By  a  remarkable  coincidence  (unique  in  the  solar  system),  these  two  fields 
are  of  roughly  the  same  order  of  magnitude:  the  effect  of  the  Sun  is  rather 
less  than  half  that  of  the  Moon. 

The  only  reason  why  these  very  small  fields  can  lead  to  significant  effects 
is  that  they  change  with  time.  As  the  Earth  rotates,  the  value  of  0  at  any 
point  on  its  surface  varies.  Thus  the  field  described  by  (6.40)  oscillates 
periodically  with  time.  Because  of  the  symmetry  of  (6.40)  in  the  central 
plane,  the  main  term  has  an  oscillation  period  of  12  hours  in  the  case  of 
the  Sun,  and  slightly  more  for  the  Moon,  because  of  its  changing  position. 
This  explains  the  most  noticeable  feature  of  the  tides  —  their  twice-daily 
periodicity.  Unless  the  Moon  is  directly  over  the  equator,  however,  a  point 
on  the  Earth  that  passes  directly  beneath  it  will  not  also  pass  directly 
opposite,  so  that  the  two  daily  tides  may  be  of  unequal  height.  In  other 
words,  there  is  an  additional  term  in  the  field  with  a  period  of  24  rather 
than  12  hours. 

At  new  moon  or  full  moon,  the  Sun  and  Moon  are  acting  in  the  same 
direction,  and  the  tides  are  unusually  high;  these  are  the  spring  tides.  On 
the  other  hand,  at  the  first  and  third  quarters,  when  the  Sun  and  Moon 
are  at  right  angles  to  each  other,  their  effects  partially  cancel,  and  we  have 
the  relatively  low  neap  tides.  (See  Fig.  6.6.)  Measurements  of  the  relative 
heights  of  the  tides  at  these  times  provided  one  of  the  earliest  methods 
of  estimating  the  mass  of  the  Moon.  (A  more  accurate  method  will  be 
discussed  in  the  next  chapter.) 

To  get  some  idea  of  the  order  of  magnitude  of  the  effects  produced  by 
the  tidal  field,  (6.40),  let  us  make  the  simplest  possible  assumptions.  We 
consider  a  perfectly  rigid  solid  Earth,  completely  covered  by  ocean,  and 
suppose  that  the  natural  periods  of  oscillation  are  short  compared  to  the 
rotation  period.   In  that  case,  the  rotation  is  slow  enough  for  the  water 
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(a)  Spring  tides  (b)  Neap  tides 


Fig.  6.6 

to  reach  equilibrium  under  the  combined  forces  of  Earth  and  Moon.  The 
problem  is  then  one  in  hydrostatics  rather  than  hydrodynamics. 

In  equilibrium,  the  surface  of  the  water  must  be  an  equipotential  sur- 
face. We  can  thus  calculate  the  height  h(9)  through  which  it  is  raised,  as 
a  function  of  the  angle  8  to  the  Moon's  direction  (see  Fig.  6.5).  Since  h(9) 
is  certainly  small,  the  change  in  the  Earth's  gravitational  potential  is  ap- 
proximately goh(6).  This  change  must  be  balanced  by  the  potential  due  to 
the  Moon,  so  that 

Thus,  using  go  =  GM/r2,  we  find 

h{6)  =  fo0(§  cos2  0  ~  §)>       where  =  jj-j •  (6.43) 

This  ho  is  just  (6.41)  times  an  extra  factor  of  r.  Thus,  using  r  =  6370  km, 
we  find,  for  the  Moon,  foo  =  0.36  m,  and  for  the  Sun,  h'0  =  0.16  m. 

These  figures  must  not  be  taken  as  more  than  rough  order-of-magnitude 
estimates  of  the  height  of  the  tides.  Even  on  a  completely  ocean-covered 
Earth,  there  would  be  important  modifying  effects.  First,  we  have  neglected 
the  gravitational  attraction  of  the  ocean  itself.  The  tidal  bulges  exert  an 
attraction  that  tends  to  increase  ho  slightly,  in  fact  by  about  12  per  cent 
(see  Problem  19).  More  important,  however,  is  the  fact  the  Earth  is  not 
perfectly  rigid,  and  is  itself  distorted  by  tidal  forces.  Because  the  observed 
tidal  range  refers  not  to  absolute  height  but  to  the  relative  heights  of  sea 
surface  and  sea  floor,  this  reduces  the  effective  height  by  a  substantial  factor. 
Indeed,  in  the  extreme  case  of  a  fluid  Earth  enclosed  by  a  completely  flexible 
crust,  there  would  be  essentially  no  observable  tide  at  all. 
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The  values  obtained  from  (6.43)  may  seem  rather  small,  especially  when 
further  reduced  by  the  effect  just  mentioned.  However,  it  is  important  to 
remember  that  they  refer  to  the  unrealistic  case  of  an  Earth  without  con- 
tinents. The  observed  tidal  range  in  mid-ocean  (which  can  be  measured 
by  ranging  from  satellites)  is  in  fact  quite  small,  normally  less  than  a  me- 
tre. Large  tides  are  a  feature  of  continental  shelf  areas,  and  are  strongly 
dependent  on  the  local  topography.  Note  too  that  the  Moon's  attraction 
at  any  point  on  the  Earth's  surface  is  a  periodic  force,  whose  effect  may 
be  greatly  enhanced  by  the  phenomenon  of  resonance,  discussed  in  §2.6. 
The  natural  periods  of  oscillation  of  a  body  of  water  depend  on  a  variety  of 
factors,  including  its  size,  shape  and  depth,  and  there  are  critical  values  for 
which  one  of  the  periods  is  close  to  12  hours.  Then,  because  the  damping 
produced  by  tidal  friction  is  normally  quite  small  except  in  very  shallow 
water,  very  large  tides  can  be  set  up. 

Another  effect  of  resonance  is  to  delay  the  oscillations,  so  that  they  are 
not  precisely  in  phase  with  the  applied  force.  At  resonance,  the  phase  lag 
is  a  quarter  period  (about  three  hours  in  this  case).  Thus  we  should  not 
normally  expect  the  times  of  high  tide  to  coincide  with  the  times  when 
the  Moon  is  overhead,  or  directly  opposite.  In  practice,  the  times  are 
determined  in  a  very  complicated  way  by  the  actual  shapes  of  the  oceans. 

6.7    The  Field  Equations 

It  is  often  convenient  to  obtain  the  electric  or  gravitational  potential  by 
solving  a  differential  equation  rather  than  performing  an  integration.  Al- 
though we  shall  not  in  fact  need  to  use  this  technique  in  this  book,  we 
include  a  short  discussion  of  it  because  of  its  very  important  role  in  more 
advanced  treatments  of  mechanics  and  electromagnetic  theory. 

To  find  the  relevant  equations,  we  consider  first  a  single  charge  q  located 
at  the  origin.  The  electric  field  is  then 


Now  let  us  consider  a  closed  surface  S  surrounding  the  charge,  and  evaluate 
the  surface  integral  of  the  component  of  E  normal  to  the  surface, 


E  = 


47reor2 


r. 
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where  dS  is  the  element  of  surface  area,  and  n  is  a  unit  vector  normal  to 
the  surface,  directed  outwards  (see  Fig.  6.7).  If  a  is  the  angle  between  f 


and  n,  then  r  ■  n  dS  =  dS  cos  a  is  the  projection  of  the  area  dS  on  a  plane 
normal  to  the  radius  vector  r.  Hence  it  follows  from  the  definition  of  solid 
angle  (see  (4.42))  that  r  ■  ndS/r2  is  equal  to  the  solid  angle  df2  subtended 
at  the  origin  by  the  element  of  area  dS.  Hence 


since  the  total  solid  angle  subtended  by  a  closed  surface  around  the  origin 
is  4tv.  (It  is  to  ensure  that  the  factor  of  4ir  here  cancels  that  Coulomb's  law 
in  SI  units  is  written  with  an  explicit  factor  of  1/47T.) 

We  have  implicitly  assumed  that  each  radial  line  cuts  the  surface  only 
once.  However,  we  can  easily  remove  this  restriction.  Evidently,  a  radial  line 
must  always  cut  it  in  an  odd  number  of  points.  (See  Fig.  6.8.)  Moreover,  the 
contributions  to  the  surface  integral  from  points  where  it  goes  into,  rather 
than  out  of,  the  surface  are  all  of  opposite  sign  (because  r  ■  n  =  cos  a  is 
negative  at  these  points)  and  of  equal  magnitude.  Thus  all  the  terms  but 
one  will  cancel,  and  we  obtain  the  same  answer  (6.44). 

Similarly,  if  we  consider  a  surface  S  which  does  not  enclose  the  origin, 
we  can  see  that  the  surface  integral  will  vanish.  For  each  radial  line  cuts  S 
in  an  even  number  of  points,  half  in  each  direction,  so  that  all  contributions 
will  cancel. 


Fig.  6.7 


(6.44) 
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Fig.  6.8 


It  is  now  clear  how  we  may  generalize  this  result  to  an  arbitrary  distri- 
bution of  charges.  Any  charge  q  located  anywhere  within  a  closed  surface  S 
will  contribute  to  the  surface  integral  an  amount  q/eo,  while  charges  outside 
contribute  nothing.  Therefore,  the  value  of  the  integral  is  1  /cq  times  the  to- 
tal charge  enclosed  within  the  surface.  If  we  have  a  continuous  distribution 
of  charge,  with  charge  density  p(r),  then 


where  V  is  the  volume  enclosed  by  the  surface  S.  This  is  often  called 
Gauss's  law. 

Now,  by  the  theorem  of  Gauss  (see  (A. 36)),  the  surface  integral  is  equal 
to  the  volume  integral  of  the  divergence  of  E: 


This  equation  must  be  true  for  an  arbitrary  volume  V,  and  therefore  the 
integrand  itself  must  vanish: 


(6.45) 


Hence  (6.45)  may  be  written 


V-E  =  e0"  V 


(6.46) 
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This  is  the  differential  form  of  Gauss's  law.  The  electric  field  is  in  fact  com- 
pletely determined  (except  for  a  possible  additive  uniform  field  extending 
over  the  whole  of  space)  by  this  equation  together  with  the  condition  that 
it  be  a  conservative  field, 

V  A  E  =  0.  (6.47) 

This  latter  equation  guarantees  the  existence  of  the  potential  cf>  related  to 
E  by  E  =  — V0.  (See  §6.6.)  Substituting  in  (6.46),  we  obtain  Poisson's 
equation  for  the  potential, 

V20  =  -eo  V  (6.48) 

The  main  importance  of  these  equations  lies  in  the  fact  that  they  pro- 
vide local  relations  between  the  potential  or  field  and  the  charge  density. 
The  expression  (6.15)  for  <j>  in  terms  of  p  is  non-local,  in  the  sense  that  it 
expresses  the  potential  <f>  at  one  point  as  an  integral  involving  the  charge 
density  at  all  points  of  space.  To  use  this  equation,  we  must  know  p  every- 
where. Frequently,  however,  we  are  interested  in  the  field  in  some  restricted 
region  of  space,  where  we  do  know  the  value  of  p,  while,  instead  of  having 
information  about  p  outside  this  region,  we  have  some  conditions  on  the 
fields  at  the  boundary.  This  type  of  problem  may  be  solved  by  looking 
for  solutions  of  (6.48)  with  appropriate  boundary  conditions.  A  particu- 
larly important  special  case  is  that  in  which  p  vanishes  inside  the  region  of 
interest.  Then  (6.48)  becomes  Laplace's  equation, 

V2tj)  =  0.  (6.49) 

The  gravitational  case  is  of  course  entirely  similar.  The  field  equations 

are 

V  •  g  = -47TGP,  (6.50) 

where  p  is  now  the  mass  density,  and 

VAg  =  0.  (6.51) 

Note  the  explicit  appearance  of  the  factor  of  4-zr  on  the  right  hand  side  of 
(6.50),  which  arises  because  the  gravitational  force  is  conventionally  ex- 
pressed in  terms  of  unrationalized  units,  i.e.,  there  is  no  factor  of  1/Att  in 
(1.5).  Note  also  the  differing  signs  in  (6.46)  and  (6.50),  which  again  reflect 
the  difference  between  (1.5)  and  (1.6). 
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Poisson's  equation  for  the  gravitational  potential  is 

V2$  =  AirGp.  (6.52) 

6.8  Summary 

The  great  advantage  of  calculating  the  potential,  rather  than  the  field  itself 
directly,  is  that  it  is  much  easier  to  add  scalar  quantities  than  vector  ones. 
Even  in  simple  cases,  like  that  of  the  uniformly  charged  sphere,  a  direct 
integration  of  the  forces  produced  by  each  element  of  the  sphere  is  difficult 
to  perform.  In  fact,  the  introduction  of  the  potential  was  one  of  the  major 
advances  in  mechanics,  which  allowed  many  previously  intractable  problems 
to  be  handled  relatively  simply. 

This  chapter,  in  which  we  have  discussed  the  method  of  determining 
the  force  on  a  particle  from  a  knowledge  of  the  positions  of  other  masses  or 
charges,  is  complementary  to  the  previous  chapters,  where  we  discussed  the 
motion  of  a  particle  under  known  forces.  Together,  they  provide  a  method 
of  solving  most  of  the  problems  in  which  the  object  of  immediate  interest 
is  (or  may  be  taken  to  be)  a  single  particle,  moving  under  the  action  of 
conservative  forces. 


Problems 

1.  Find  the  potential  and  electric  field  at  points  on  the  axis  of  symmetry 
of  a  uniformly  charged  fiat  circular  disc  of  charge  q  and  radius  a.  What 
happens  to  the  field  if  we  keep  the  charge  per  unit  area  a  =  q/wa2 
fixed,  and  let  a  — >  oo? 

2.  Calculate  the  quadrupole  moment  of  the  disc  of  Problem  1,  and  deduce 
the  form  of  the  field  at  large  distance  in  any  direction.  Verify  that  on 
the  axis  it  agrees  with  the  exact  value  when  r/a  is  sufficiently  large. 

3.  Write  down  the  potential  energy  of  a  pair  of  charges,  q  at  a  and  —  q 
at  the  origin,  in  a  field  with  potential  4>(r).  By  considering  the  limit 
a  — >  0,  show  that  the  potential  energy  of  a  dipole  of  moment  d  is 
V  =  —d  ■  E.  If  the  electric  field  is  uniform,  when  is  this  potential 
energy  a  minimum?  Show  that  the  dipole  experiences  a  net  moment, 
or  couple,  G  =  d  A  E,  and  that  in  a  non-uniform  field  there  is  also  a 
net  force,  F  =  (d  ■  'V)E.  (Take  d  in  the  z-direction,  and  show  that 
F  =  ddE/dz.) 
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4.  *Write  the  electric  field  of  a  dipole  in  vector  notation.  Using  the  result 
of  Problem  3,  find  the  potential  energy  of  a  dipole  of  moment  d  in  the 
field  of  another  dipole  of  moment  d! .  (Take  d'  at  the  origin  and  d  at 
position  r.)  Find  the  forces  and  couples  acting  between  the  dipoles 
if  they  are  placed  on  the  z-axis  and  (a)  both  are  pointing  in  the  z- 
direction,  (b)  both  are  pointing  in  the  ^-direction,  (c)  d  is  in  the  z- 
direction,  and  d'  in  the  ^-direction,  and  (d)  d  is  in  the  a;-direction  and 
d'  in  the  y-direction. 

5.  Show  that  the  work  done  in  bringing  two  charges  q\  and  <?2,  initially  far 
apart,  to  a  separation  rn  is  qiqij  ^t^ryi-  Write  down  the  correspond- 
ing expression  for  a  system  of  many  charges.  Show  that  the  energy 
stored  in  the  charge  distribution  is 

V  =  \  53  9j(Ai(T*j)) 

3 

where  <f>j(rj)  is  the  potential  at  rj  due  to  all  the  other  charges.  Why 
does  a  factor  of  \  appear  here,  but  not  in  the  corresponding  expression 
for  the  potential  energy  in  an  external  potential  (f>(r)? 

6.  Find  the  energy  stored  in  a  sphere  of  charge  q  and  radius  a  with  uniform 
charge  density,  and  show  that  infinite  energy  is  required  to  compress 
the  sphere  to  a  point.  Find  also  the  stored  energy  in  the  case  where 
the  charge  is  uniformly  spread  over  the  surface  of  the  sphere. 

7.  Near  the  Earth's  surface,  there  is  normally  a  vertical  electric  field,  typ- 
ically of  order  lOOVm-1.  Find  the  corresponding  total  charge  on  the 
Earth,  and  the  stored  energy  per  unit  surface  area. 

8.  Find  the  quadrupole  moment  of  a  distribution  of  charge  on  the  surface 
of  a  sphere  of  radius  a  with  surface  charge  density  a  =  <to(§  cos2  9—  |). 
Find  the  total  energy  stored  in  this  distribution. 

9.  *Two  equal  charges  q  are  located  at  the  points  (±a,0,0),  and  two 
charges  —  q  at  (0,±a,0).  Find  the  leading  term  in  the  potential  at 
large  distances,  and  the  corresponding  electric  field. 

10.  Find  the  gravitational  potential  at  large  distances  of  a  thin  circular 
loop  of  radius  a  and  mass  m,  up  to  terms  of  order  r-3.  Find  also  the 
potential,  and  the  leading  term  in  the  gravitational  field  near  the  origin, 
at  distances  r  a. 

11.  *Six  equal  point  masses  m  are  located  at  the  points  ±ai,  ±aj  and 
±afc.  Show  that  the  quadrupole  term  in  the  potential  vanishes,  and 
find  the  leading  correction  to  the  monopole  term  —QGm/r.  {Note:  This 
requires  expansion  of  the  potential  up  to  terms  of  order  a4/r5.) 
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12.  A  diffuse  spherical  cloud  of  gas  of  density  p  is  initially  at  rest,  and 
starts  to  collapse  under  its  own  gravitational  attraction.  Find  the  radial 
velocity  of  a  particle  which  starts  at  a  distance  a  from  the  centre  when  it 
reaches  the  distance  r.  Hence,  neglecting  other  forces,  show  that  every 
particle  will  reach  the  centre  at  the  same  instant,  and  that  the  time 
taken  is  1v/37r/32pG.  Evaluate  this  time  in  years  if  p  =  10-19kgm-3. 
[Hint:  Assume  that  particles  do  not  overtake  those  that  start  nearer 
the  centre.  Verify  that  your  solution  is  consistent  with  this  assumption. 
The  substitution  r  =  asm2  9  may  be  used  to  perform  the  integration.) 
Estimate  the  collapse  time  for  the  Earth  and  for  the  Sun,  taking  a 
suitable  value  of  p  in  each  case. 

13.  Given  that  the  mass  of  the  gas  cloud  of  Problem  12  is  1030  kg,  and 
that  its  contraction  is  halted  by  the  build-up  of  pressure  when  a  star 
of  radius  10  km  has  been  formed,  use  the  result  of  Problem  6  to  find 
the  total  energy  released,  assuming  that  the  density  of  the  star  is  still 
uniform. 

14.  The  rotation  period  of  Jupiter  is  approximately  10  hours.  Its  mass  and 
radius  are  318Me  and  11.0i?E,  respectively  (E  =  Earth).  Calculate 
approximately  its  oblateness,  neglecting  the  variation  in  density.  (The 
observed  value  is  about  1/15.) 

15.  *  Assume  that  the  Earth  consists  of  a  core  of  uniform  density  pc,  sur- 
rounded by  a  mantle  of  uniform  density  pm,  and  that  the  boundary 
between  the  two  is  of  similar  shape  to  the  outer  surface,  but  with  a 
radius  only  three- fifths  as  large.  Find  what  ratio  of  densities  pc/ 'pm  is 
required  to  explain  the  observed  quadrupole  moment.  (Hint:  Treat  the 
Earth  as  a  superposition  of  two  ellipsoids  of  densities  pm  and  pc  —  pm. 
Note  that  in  reality  neither  core  nor  mantle  is  of  uniform  density.) 

16.  The  distance  between  the  Earth  and  the  Moon  is  gradually  increasing 
(because  of  tidal  friction  —  see  §8.4).  Estimate  the  height  of  the  tides 
when  the  Moon  was  10  Earth  radii  away.  How  far  away  will  it  be  when 
the  lunar  and  solar  tides  are  equal  in  magnitude? 

17.  Suppose  that  at  some  time  the  Moon  had  been  ocean-covered  and  ro- 
tating relative  to  the  Earth.  Find  the  ratio  between  the  heights  of  the 
tides  raised  on  the  Moon  by  the  Earth  and  on  the  Earth  by  the  Moon. 
(The  Moon's  radius  is  Rm  =  0.27i?E-)  Estimate  how  high  the  tides  on 
the  Moon  would  have  been  when  the  Earth-Moon  distance  was  10_Re- 

18.  Jupiter  has  a  satellite  Io  only  a  little  heavier  than  the  Moon  (mass 
8.93  x  1022kg,  as  compared  to  7.35  x  1022kg).  The  semi-major  axis  of 
its  orbit  is  4.22  x  105  km,  compared  with  3.84  x  105  km  for  the  Moon. 
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Estimate  the  height  of  the  tides  raised  on  Jupiter  by  Io.  (See  Problem 
14  for  Jovian  data.) 

19.  *The  tidal  bulge  in  the  ocean  may  be  regarded  as  adding  a  surface 
mass  distribution  of  the  same  form  as  the  surface  charge  distribution 
of  Problem  8,  positive  near  0  =  0  and  ty  and  negative  near  9  =  tt/2. 
Find  the  quadrupole  moment  of  this  distribution,  and  show  that  when 
it  is  included  in  (6.43)  the  effective  height  of  the  tides  is  increased  by 
a  factor  (1  —  3p0/5pE)_1 ,  where  pD  is  the  mean  density  of  the  oceans 
and  Pe  that  of  the  Earth  as  a  whole.  Evaluate  this  factor,  given  that 
pE  =  5.52  x  103kgm~3  and  pa  =  1.03  x  103kgm~3.  (Note,  however, 
that  the  effective  height  is  decreased  by  a  larger  factor  due  to  the  non- 
rigidity  of  the  Earth.) 

20.  *Two  small  identical  uniform  spheres  of  density  p  and  radius  r  are 
orbiting  the  Earth  in  a  circular  orbit  of  radius  a.  Given  that  the  spheres 
are  just  touching,  with  their  centres  in  line  with  the  Earth's  centre, 
and  that  the  only  force  between  them  is  gravitational,  show  that  they 
will  be  pulled  apart  by  the  Earth's  tidal  force  if  a  is  less  than  ac  = 
2(pe/p)1^3-Re,  where  ps  is  the  mean  density  of  the  Earth  and  Re  its 
radius.  (This  is  an  illustration  of  the  existence  of  the  Roche  limit,  within 
which  small  planetoids  would  be  torn  apart  by  tidal  forces.  The  actual 
limit  is  larger  than  the  one  found  here,  because  the  spheres  themselves 
would  be  distorted  by  the  tidal  force,  thus  enhancing  the  effect.  It  is 
ac  =  2.45(pe/p)1/3^e-  For  the  mean  density  of  the  Moon,  for  example, 
p  =  3.34  x  103kgm"3,  this  gives  ac  =  2.89RE.) 

21.  Verify  that  the  potential  of  a  uniformly  charged  sphere  satisfies  Pois- 
son's  equation,  both  inside  and  outside  the  sphere.  (You  will  need 
(A. 59)  for  this  and  subsequent  problems.) 

22.  The  potential  <j>(r)  =  (q/4neor)e~flr  may  be  regarded  as  representing 
the  effect  of  screening  of  a  charge  q  at  the  origin  by  mobile  charges  in 
a  plasma.  Calculate  the  charge  density  p  (at  points  where  r  /  0)  and 
find  the  total  charge  throughout  space,  excluding  the  origin. 

23.  By  considering  the  equilibrium  of  a  small  volume  element,  show  that  in 
a  fluid  in  equilibrium  under  pressure  and  gravitational  forces,  Vp  =  pg, 
where  p  is  the  density  and  p  the  pressure  (the  equation  of  hydrostatic 
equilibrium).  Deduce  that,  for  an  incompressible  fluid  of  uniform  den- 
sity p,  p  +  p$  is  a  constant.  Use  this  result  to  obtain  a  rough  estimate 
of  the  pressure  at  the  centre  of  the  Earth.  (Mean  density  of  Earth 
=  5.5  x  103  kgm-3.  Note  that  the  pressure  at  the  surface  is  essentially 
zero.  The  actual  pressure  at  the  centre  is  more  than  this  estimate  — 
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in  fact  about  3.6  x  1011  Pa  —  because  of  the  non-uniformity  of  p.  It 
should  be  noted  that  —  as  indicated  in  Problem  12  —  even  very  small 
departures  from  hydrostatic  equilibrium  would  result  in  collapse  on  a 
short  time  scale,  only  checked  by  an  increase  in  internal  pressure  via 
other  processes.) 

24.  *  Assume  that  the  pressure  p  in  a  star  with  spherical  symmetry  is  re- 
lated to  the  density  p  by  the  (distinctly  unrealistic)  equation  of  state 
p  =  \kp2,  where  A;  is  a  constant.  Use  the  fluid  equilibrium  equation 
obtained  in  Problem  23  to  find  a  relation  between  p  and  Hence  show 
that  Poisson's  equation  yields 

d2\rp(r)]         4ttG    ,  . 
__  =  __rp(r). 

Solve  this  equation  with  the  boundary  conditions  that  p  is  finite  at 
r  =  0  and  vanishes  at  the  surface  of  the  star.  Hence  show  that  the 
radius  a  of  the  star  is  determined  solely  by  k  and  is  independent  of  its 
mass  M.  Show  also  that  M  =  (4/7r)a3p(0). 

25.  *Show  that  the  moment  of  the  Earth's  gravitational  force  may  be  writ- 
ten in  the  form 

3GMma2J2  ., 
mr  A  g  =   g  (k  ■  r)(k  A  r). 

Consider  a  satellite  in  a  circular  orbit  of  radius  r  in  a  plane  inclined  to 
the  equator  at  an  angle  a.  By  introducing  a  pair  of  axes  in  the  plane 
of  the  orbit,  as  in  §5.6  (see  Fig.  5.10),  show  that  the  average  value  of 
this  moment  is 

,          .           SGMma  J2         ,,  . 
(mr  A  g)m  =  —  cos  a  (k  A  n), 

where  n  is  the  normal  to  the  orbital  plane.  Hence  show  that  the 
orbit  precesses  around  the  direction  k  of  the  Earth's  axis  at  a  rate 
ft  =  —  (3  J2C12 /2r2)u>  cos  a,  where  w  is  the  orbital  angular  velocity.  Eval- 
uate this  rate  for  an  orbit  400  km  above  the  Earth's  surface,  with  an 
inclination  of  30°.  Find  also  the  precessional  period. 

26.  *Show  that  the  moment  of  the  solar  tidal  force  mgs  on  an  Earth  satel- 
lite is 

3GMsm  . 
mrAgs  =   g  (r  ■  as)(r  A  as), 
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where  as  is  the  position  vector  of  the  Sun  relative  to  the  Earth.  Con- 
sider the  effect  on  the  Moon,  whose  orbit  is  inclined  at  a  =  5°  to  the 
ecliptic  (the  Earth's  orbital  plane).  Introduce  axes  i  and  j  in  the  plane 
of  the  ecliptic,  with  i  in  the  direction  where  the  Moon's  orbit  crosses  it. 
Write  down  the  positions  of  the  Sun  and  Moon,  relative  to  Earth,  as 
functions  of  time.  By  averaging  both  over  a  month  and  over  a  year  (i.  e., 
over  the  positions  in  their  orbits  of  both  bodies),  show  that  the  mo- 
ment leads  to  a  precession  of  the  Moon's  orbital  plane,  at  a  precessional 
angular  velocity 

il  =  - —  cosafe, 

where  w  is  the  Earth's  orbital  angular  velocity.  Compute  the  preces- 
sional period.  (Note  that  this  calculation  is  only  approximately  cor- 
rect.) 
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Chapter  7 

The  Two-Body  Problem 


We  shall  be  mainly  concerned  in  this  chapter  with  an  isolated  system  of 
two  particles,  subject  only  to  the  force  between  the  two.  However,  as  it  is 
no  harder  to  solve,  and  considerably  extends  the  range  of  applicability  of 
the  results,  we  shall  also  allow  the  presence  of  a  uniform  gravitational  field. 

7.1     Centre-of-mass  and  Relative  Co-ordinates 

We  denote  the  positions  and  masses  of  the  two  particles  by  r\,r2  and 
mi,  ?7i2.  If  the  force  on  the  first  particle  due  to  the  second  is  F,  then,  by 
Newton's  third  law,  that  on  the  second  due  to  the  first  is  —  F.  Thus,  in  a 
uniform  gravitational  field  g,  the  equations  of  motion  are 

miTi  =  mig  +  F, 
rn2r2  =  m2g  -  F. 

It  is  convenient  to  introduce  new  variables  in 
define  the  position  of  the  centre  of  mass, 

m1r1+m2r2 

H  =   i  ' 

mi  +  m2 

and  the  relative  position, 

v  =  T\  —  r2.  (7.3) 

(See  Fig.  7.1.)  The  centre  of  mass  is  often  denoted  by  f.  This  symbol  is, 
however,  typographically  inconvenient  when  dots  are  used  to  denote  time 
derivatives.  From  (7.1),  we  get  the  equation  of  motion  for  R  by  adding: 

MR  =  Mg,        where       M  =  m1  +  m2.  (7.4) 


place  of  ri  and  r2.  We 


(7.2) 
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Fig.  7.1 

One  virtue  of  the  notation  R  for  the  centre  of  mass  is  that  it  serves  to 
emphasize  the  fact  that  R  is  associated  with  the  total  mass,  M. 

The  equation  for  r  is  obtained  by  dividing  by  the  masses  and  then 
subtracting: 

ur  =  F,        where       u  =  .  7.5) 

The  mass  /j,  is  called  the  reduced  mass,  because  it  is  always  less  than  either 
mi  or  ra-2- 

These  two  equations  are  now  completely  separate.  Equation  (7.4)  shows 
that  the  centre  of  mass  moves  with  uniform  acceleration  g.  In  the  case  g  =  0 
it  is  equivalent  to  the  law  of  conservation  of  momentum, 

MR  =  miri  +  rri2ri  =  P  =  constant.  (7.6) 

The  equation  of  motion  (7.5)  for  the  relative  position  is  identical  with  that 
for  a  single  particle  of  mass  fi  moving  under  the  force  F.  If  we  can  solve 
this  one-particle  problem,  then  we  can  also  solve  the  two-particle  problem. 
When  we  have  found  R  and  r  as  functions  of  time,  we  can  obtain  the 
positions  of  the  particles  by  solving  the  simultaneous  equation  (7.2)  and 
(7.3): 

n=R+—  r,       r2=R-—r.  (7.7) 

The  separation  between  centre-of-mass  and  relative  motion  extends  to 
the  expressions  for  the  total  angular  momentum  and  kinetic  energy.  We 
have 

J  =  m\r\  A  r\  +  m-2Xi  A 

=  mi(R+  ^r)  A  (it  +  ^f)  +  m2  (R  -^r)A(k-^r), 
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by  (7.7).  It  is  easy  to  see  that  the  cross  terms  between  R  and  r  cancel,  and 
we  are  left  with 

J  =  MRAR  +  fir  Ar.  (7.8) 

Similarly  substituting  (7.7)  into 

T  =  \m\r\  +  \m2r\, 

we  find,  after  a  little  algebra, 

T  =  \MR2  +  liir2.  (7.9) 

It  is  instructive  to  examine  this  problem  also  from  the  Lagrangian  point 
of  view  introduced  in  §3.7.  Let  us  suppose  that  the  internal,  two-body 
force  is  conservative,  and  depends  only  on  the  relative  co-ordinate  r.  It 
then  corresponds  to  a  potential  energy  function  Vint(r).  The  total  potential 
energy  V  includes  not  only  V;nt  but  also  the  potential  energy  of  the  external 
gravitational  forces.  Now  the  potential  energy  of  a  particle  of  mass  m  in  a 
uniform  gravitational  field  g  is  —mg  r.  (If  g  is  in  the  —  z-direction,  this  is 
the  familiar  expression  mgz.)  Thus  the  Lagrangian  function  is 

L  =  T  —  V 

=  \mir\  +  \m2r\  +  mxg  ■  n  +  rn2g  ■  r2  -  Vint(r1  -  r2).  (7.10) 

It  is  easy  to  verify  that  Lagrange's  equations  (3.44)  yield  the  correct  equa- 
tions of  motion  (7.1). 

Now,  just  as  in  §3.7,  we  may  express  L  in  terms  of  any  six  independent 
co-ordinates  in  place  of  x\,  yi,  z±,  x2,  y2,  z2.  In  particular,  we  may  choose 
the  coordinates  of  R  and  r.  Then,  from  (7.9)  and  (7.10),  we  find 

L=\MR2  +  Mg-R+\nr2  -Vint{r).  (7.11) 

The  complete  separation  of  the  terms  involving  R  from  those  involving  r 
is  now  obvious.  It  is  easy  to  check  that  Lagrange's  equations  for  these 
co-ordinates  are  just  (7.4)  and  (7.5). 

In  this  form,  it  is  clear  that  the  crucial  point  is  the  separation  of  the 
potential  energy.  The  kinetic  energy  may  always  be  separated  according  to 
(7.9),  but  only  for  a  uniform  gravitational  field  does  the  potential  energy 
separate  in  a  similar  way. 
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7.2    The  Centre-of-mass  Frame 

It  is  often  convenient  to  describe  the  motion  of  the  system  in  terms  of  a 
frame  of  reference  in  which  the  centre  of  mass  is  at  rest  at  the  origin.  (In  a 
gravitational  field,  this  is  an  accelerated,  non-inertial  frame,  but  it  is  still 
useful.)  This  is  the  centre-of-mass  (CM)  frame.  We  shall  denote  quantities 
referred  to  it  by  an  asterisk. 

The  relative  position  r  is  of  course  independent  of  the  choice  of  origin, 
so,  setting  R*  =  0  in  (7.7),  we  find 

r*-^r         r*--^r  (7  12) 

ri-  Mr,       r2-     M  r.  (7.12) 

In  this  frame,  the  momenta  of  the  two  particles  are  equal  and  opposite: 

m\r\  =  —m2r2  =  fir  =  p* ,  (7-13) 

say. 

As  we  shall  see  explicitly  later,  it  is  often  convenient  to  solve  a  problem 
first  in  the  CM  frame.  To  find  the  solution  in  some  other  frame,  we  then 
need  the  relations  between  the  momenta  in  the  two  frames.  Let  us  consider 
a  frame  in  which  the  centre  of  mass  is  moving  with  velocity  R.  Then  the 
velocities  of  the  two  particles  are 

r1  =  R  +  r*1,        r2=R  +  r2. 

Hence  by  (7.13),  their  momenta  are 

Pj  =  miri  =  m\R  +  p*,       p2  =  rn2r2  =  m2R  —  p* .  (7-14) 

From  (7.8)  and  (7.9),  it  follows  that  the  total  momentum,  angular  momen- 
tum and  kinetic  energy  in  the  CM  frame  are 

P*  =  0, 

J*  =  fir  A  r  =  r  A  p* ,  (7.15) 
r>*2 

T*  =  yr2  =5_. 

2P  2fi 
Thus  in  any  other  frame  we  can  write 

P  =  MR, 

J  =  MRAR+J*,  (7.16) 
T  =  \MR2  +T*. 
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To  obtain  the  values  in  any  frame  from  those  in  the  CM  frame,  we  have 
only  to  add  the  contribution  of  a  particle  of  mass  M  located  at  the  centre  of 
mass  R.  We  shall  see  in  the  next  chapter  that  this  is  a  general  conclusion, 
not  restricted  to  two-particle  systems. 

As  an  example,  let  us  consider  two  bodies  moving  under  their  mutual 
gravitational  attraction,  for  instance  the  Moon  and  the  Earth. 

Example:  Motion  of  a  satellite 

How  does  the  period  of  the  Moon's  orbit  relate  to  the  Earth- 
Moon  distance?  How  far  does  the  apparent  position  of  the  Sun 
oscillate  because  of  the  influence  of  the  Moon  on  the  Earth's 
position? 

Equation  (7.5)  becomes 

„  Gm\in2        „  GMji 


This  is  identical  with  the  equation  for  a  particle  moving  around  a  fixed 
point  mass  M.  In  particular,  for  an  elliptic  orbit,  the  period  is  given  by 
(4.31), 


Note  that  a  here  is  the  semi-major  axis  of  the  relative  orbit  (the  median 
distance  between  the  bodies),  and  that  M  is  the  sum  of  the  masses, 
rather  than  the  mass  of  the  heavier  body.  Thus  Kepler's  third  law  is 
only  approximately  correct:  the  orbital  period  depends  not  only  on  the 
semi-major  axis,  but  also  on  the  mass.  (See  also  Chapter  4.  Problem 
31.) 

The  only  cases  in  the  solar  system  for  which  the  lighter  mass  is  an 
appreciable  fraction  of  the  total  are  those  of  the  Earth-Moon  system, 
for  which  m\j =  1/81.3,  and  the  Pluto-Charon  system,  for  which 
mi/mo.  =  1/8.5.  If  we  were  to  compute  the  period  of  the  Moon's  orbit 
from  Kepler's  third  law  by  comparing  it  with  the  period  of  a  small  Earth 
satellite,  we  should  obtain  the  value  given  above  but  with  mi  in  place  of 
M.  This  would  yield  a  period  about  4  hours  too  long. 

In  the  CM  frame  of  the  Earth-Moon  system,  we  see  from  (7.12)  that 
both  bodies  move  in  ellipses  around  the  centre  of  mass,  with  semi-major 
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axes 


For  the  Earth-Moon  system,  a  =  3.84  x  105km.  The  Earth  therefore 
moves  around  the  centre  of  mass  in  a  small  ellipse  with  semi-major  axis 
02  =  a/82.3  =  4670  km.  (Note  that  the  centre  of  mass  is  inside  the 
Earth.)  This  leads  to  a  small  oscillation  in  the  direction  of  the  Sun  (see 
Fig.  7.2  —  which  is  not  drawn  to  scale),  of  angular  amplitude  a  ss  a^/A, 
where  A  is  the  distance  to  the  Sun;  since  A  =  1.5  x  108  km,  a  =  6.4 
seconds  of  arc. 


A  larger  effect  can  be  obtained  by  looking  at  the  direction  of  some 
object  closer  to  us  than  the  Sun,  such  as  an  asteroid.  This  effect  provides 
a  method  of  determining  the  lunar  mass. 

In  the  frame  in  which  the  Sun  is  at  rest,  the  centre  of  mass  of  the 
Earth-Moon  system  moves  in  an  ellipse  around  the  Sun,  and  the  motion 
of  the  two  bodies  around  the  centre  of  mass  is  superimposed  on  this  larger 
orbital  motion.  In  the  approximation  in  which  the  Sun's  gravitational  field 
is  uniform  over  the  dimension  of  the  Earth-Moon  system,  the  two  types 
of  motion  are  completely  separate,  and  no  transfer  of  energy  or  angular 
momentum  from  one  to  the  other  can  occur.  However,  as  we  shall  see  later, 
there  are  small  effects  due  to  the  non-uniformity  of  the  Sun's  field. 


Moon 


Sun 


Fig.  7.2 
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7.3    Elastic  Collisions 


A  collision  between  two  particles  is  called  elastic  if  there  is  no  loss  of  kinetic 
energy  in  the  collision  —  that  is,  if  the  total  kinetic  energy  after  the  collision 
is  the  same  as  that  before  it.  Such  collisions  are  typical  of  very  hard  bodies, 
like  billiard  balls.  (They  are  also  important  in  atomic  and  nuclear  collision 
problems.) 

It  is  easy  to  describe  such  a  collision  in  the  CM  frame.  The  particles 
must  approach  each  other  with  equal  and  opposite  momenta,  p*  and  —  p* , 
and  recede  after  the  collision  again  with  equal  and  opposite  momenta,  q* 
and  —q*.  Thus  each  particle  is  scattered  through  the  same  angle  9*  (see 
Fig.  7.3).  Since  the  collision  is  elastic,  we  have,  from  (7.15), 


Fig.  7.3 


„*2  *2 

T*  =  ^—  =  ■=—.  (7.17) 

Thus  the  magnitudes  of  the  momenta  before  and  after  the  collision  are  the 
same, 

V*  =  q*.  (7.18) 

In  practice,  most  experiments  are  performed  with  one  particle  initially 
at  rest  (or  nearly  so)  in  the  laboratory.  To  interpret  such  an  experiment,  we 
therefore  need  to  use  the  laboratory  {Lab)  frame,  in  which  the  momentum 
of  particle  2,  the  target,  before  the  collision  is  zero,  so  that  p2  =  0. 

We  shall  denote  the  Lab  momentum  of  the  incoming  particle  by  p1,  and 
the  momenta  after  the  collision  by  q1  and  q2,  and  the  angles  of  scatter- 
ing and  recoil  by  9  and  a  (see  Fig.  7.4).  We  could  work  out  the  relations 
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between  these  quantities  by  using  the  conservation  laws  of  energy  and  mo- 


Fig.  7.4 

mentum  directly  in  the  Lab  frame,  but  it  is  actually  simpler  to  relate  them 
to  the  CM  quantities,  using  the  formulae  obtained  in  §7.2. 
Since  p2  =  0,  we  have  from  (7.14), 


and  also 


Pi 


R  =  —p*, 

m2 


mi   ,  ,    .  M 
-p  +p  =  — p 


(7.19) 


(7.20) 


rn2  ni2 

The  momenta  after  the  collision  are  again  given  by  (7.14),  but  with  q*  in 
place  of  p*.  They  are  then,  using  also  (7.19), 


9i 


mi  ,  » 
 P  +Q 

m2 


q2=p 


All  these  relations  may  be  conveniently  summarized  in  a  vector  diagram, 
drawn  in  Fig.  7.5,  which  also  incorporates  the  momentum  conservation 


Fig.  7.5 
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equation  in  the  Lab,  p1  =  q1  +  q2-  Any  desired  relation  between  momenta, 
energies  or  angles  may  be  extracted  from  this  diagram  without  difficulty. 
We  consider  explicitly  only  a  few  important  relations. 

By  (7.18),  the  vectors  p*,q*,q2  form  an  isosceles  triangle.  Thus  the 
recoil  angle  and  the  magnitude  of  the  recoil  momentum  q2  are  given  in 
terms  of  CM  quantities  by 

a  =  i(ir -(?*),        q2  =  2j9*sin±<9*.  (7.21) 

The  Lab  kinetic  energy  transferred  to  the  target  particle  is  therefore 

T    —     g2     _  2P*2  „;„2  In* 

Li  —  —   sin  ~V  . 

2m2  m2 

On  the  other  hand,  the  total  kinetic  energy  in  the  Lab  is  just  the  kinetic 
energy  of  the  incoming  particle, 

T_  Pi  _  mV2 


2m\  2m\m\ 

by  (7.20).  The  interesting  quantity  is  the  fraction  of  the  total  kinetic  energy 
which  is  transferred.  This  is 

iT.  (7.22) 

The  maximum  possible  kinetic  energy  transfer  occurs  for  a  head-on  collision 
(9*  =  7r),  and  is  T2/T  =  4mi  7712/(7711  +  m^)1 .  Clearly  this  can  be  close  to 
unity  only  if  mi  and  ni2  are  comparable  in  magnitude.  If  the  incoming 
particle  is  very  light,  it  bounces  off  the  target  with  little  loss  of  energy; 
if  it  is  very  heavy,  it  is  hardly  deflected  at  all  from  its  original  trajectory, 
and  again  loses  little  of  its  energy.  For  example,  in  a  proton-a-particle 
collision  {m\/m2  =  4  or  |),  the  maximum  fractional  energy  transfer  is  64 
per  cent.  In  an  electron-proton  collision  (m\jm2  =  1/1836)  it  is  about  0.2 
per  cent.  (These  conclusions  are  changed  by  relativistic  effects  for  speeds 
approaching  that  of  light.) 

Another  important  relation  is  that  between  the  Lab  and  CM  scattering 
angles.  It  is  easy  to  prove  by  elementary  trigonometry  (most  simply  by 
dropping  a  perpendicular  from  the  upper  vertex  in  Fig.  7.5)  that 

sin  6* 

tantf  =  ,  -.  (7.23) 

(7711/7712)  +  cost'* 
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Note  that  this  relation  is  independent  of  the  momenta  of  the  two  particles, 
and  depends  only  on  their  mass  ratio. 

As  9*  varies  from  0  to  n,  the  vector  q*  sweeps  out  a  semicircle  of  radius 
p*.  If  the  target  is  the  heavier  particle  (mi/rri2  <  1),  the  left-hand  vertex  of 
Fig.  7.5  lies  inside  this  semicircle,  and  9  also  varies  from  0  to  ir.  However,  if 
the  target  is  the  lighter  particle  (mi/rri2  >  1),  the  semicircle  excludes  this 
vertex.  In  that  case,  6  =  0  both  when  6*  =  0  and  when  9*  =  it,  and  there 
is  an  intermediate  value  of  9*  for  which  9  is  a  maximum.  The  maximum 
scattering  angle  occurs  when  q1  is  tangent  to  the  circle  (see  Fig.  7.6).  It  is 
given  by 


Fig.  7.6 

sin0max  =  m2/mi.  (7.24) 

For  instance,  an  a-particle  can  only  be  scattered  by  a  proton  through  an 
angle  of  almost  14.5°,  and  a  proton  can  only  be  scattered  by  an  electron 
through  an  angle  less  than  0.031°. 

In  the  special  case  of  equal-mass  particles,  the  right  side  of  (7.23)  sim- 
plifies to  tan        so  that  we  obtain  the  simple  relation 

9=\9*,       (m1=m2).  (7.25) 

In  this  case,  the  maximum  scattering  angle  is  ir/2. 

7.4    CM  and  Lab  Cross-sections 

Let  us  now  consider  the  scattering  of  a  beam  of  particles  by  a  target.  We 
shall  suppose  that  the  target  contains  a  large  number  N  of  identical  target 
particles,  but,  as  in  §4.5,  is  still  small  enough  so  that  the  particles  do  not 
screen  each  other,  and  so  that  the  geometry  of  the  scattering  event  is  not 
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seriously  affected  by  which  target  particle  is  struck.  We  also  suppose  that 
the  beam  is  a  parallel  beam  of  particles,  each  with  the  same  mass  and 
velocity,  with  a  particle  flux  of  /  particles  crossing  unit  area  per  unit  time. 

The  incident  particles  may  or  may  not  be  the  same  as  those  in  the 
target,  but  for  the  moment  we  shall  assume  that  they  can  be  distinguished, 
so  that  it  is  possible  to  set  up  a  detector  to  count  the  number  of  scattered 
particles  emerging  in  some  direction,  without  including  also  the  recoiling 
target  particles. 

We  can  now  introduce  the  concept  of  differential  cross-section,  just  as  we 
did  for  the  case  of  a  fixed  target  in  §4.5.  Indeed  the  fact  that  the  particles 
that  are  struck  recoil  out  of  the  target  makes  no  essential  difference.  In 
any  one  collision,  the  scattering  angle  9  will  be  determined  in  some  way 
(depending  on  the  shape  of  the  particles)  by  the  impact  parameter  b.  Thus 
the  particles  scattered  through  angles  between  9  and  9  +  d9  will  be  those 
of  the  incident  particles  which  strike  any  one  of  the  target  particles  with 
impact  parameters  between  the  corresponding  values  b  and  b  +  db.  To 
find  the  number  emerging  within  a  solid  angle  in  some  specified  direction, 
we  have  to  calculate  the  corresponding  cross-sectional  area  of  the  incident 
beam, 

da  =  b \db\  dip.  (7.26) 

(compare  (4.39)  and  Fig.  4.9),  and  multiply  by  the  number  of  target  parti- 
cles, N,  and  by  the  flux,  /. 

If  we  set  up  a  detector  of  cross-sectional  area  dA  at  a  large  distance  L 
from  the  target,  the  rate  of  detection  will  be 

=  (7.27) 

exactly  as  in  (4.44).  The  ratio  da/dfl  is  the  Lab  differential  cross-section. 

We  shall  return  to  the  question  of  calculating  the  Lab  differential  cross- 
section  later.  First,  however,  we  wish  to  discuss  a  slightly  different  type  of 
experiment. 

Let  us  imagine  two  beams  of  particles  approaching  from  opposite  direc- 
tions —  a  'colliding-beam'  experiment.  In  particular,  we  shall  be  interested 
in  the  case  where  the  momenta  of  the  particles  in  the  two  beams  are  equal 
and  opposite,  so  that  we  are  directly  concerned  with  the  CM  frame.  To  be 
specific,  let  us  suppose  that  the  particles  in  one  beam  are  hard  spheres  of 
radius  a\  and  those  in  the  other  beam  are  hard  spheres  of  radius  an-  Ev- 
idently, a  particular  pair  of  particles  will  collide  if  the  distance  b  between 
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the  lines  of  motion  of  their  centres  is  less  than  a 


=  ai  +  a2  (see  Fig.  7.7). 


Fig.  7.7 

Let  us  select  one  of  the  particles  in  the  second  beam.  It  will  collide  with 
any  particles  in  the  first  beam  whose  centres  cross  an  area  a  =  na2.  This  is 
the  total  cross-section  for  the  collision,  and  is  the  same  whether  the  target 
is  at  rest  or  moving  towards  the  beam.  Note  that  it  is  the  sum  of  the  radii 
which  enters  here.  This  serves  to  emphasize  the  fact  that  the  cross-section 
is  a  property  of  the  pair  of  particles  involved  in  the  collision,  not  of  either 
particle  individually. 

To  compute  the  number  of  collisions  occurring  in  a  short  time  interval 
dt,  we  may  imagine  the  cross-sectional  area  a  to  be  attached  to  our  selected 
particle  and  moving  with  it.  Then  the  probability  that  the  centre  of  one 
of  the  particles  in  the  first  beam  crosses  this  area  in  the  time  dt  will  be 
riivadt,  where  n\  is  the  number  of  particles  per  unit  volume  in  the  beam, 
and  v  is  the  relative  velocity  (=  v\  +  U2).  If  the  number  of  particles  in 
the  second  beam  per  unit  volume  is  712,  then  the  total  number  of  collisions 
occurring  in  volume  V  is  mnivaV  dt. 

Now,  if  we  are  interested  in  the  number  of  particles  emerging  in  a  given 
solid  angle,  we  have  to  find  the  relation  between  b  and  the  CM  scattering 
angle  9*,  and  evaluate  the  corresponding  cross-sectional  area  (7.26).  In  our 
case,  it  is  clear  from  Fig.  7.7  that 


b  =  a  sin  a  =  a  cos  \Q* ■ 


(7.28) 
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Hence,  introducing  the  solid  angle 

dfi*  =  sin6*  d6*  d(p, 
as  in  (4.42),  we  find  that  the  CM  differential  cross-section  is 

\a2.  (7.29) 


da  _  1  2 


dn* 

If  we  set  up  a  detector  arranged  to  record  particles  scattered  in  a  particular 
direction  from  a  small  volume  V,  then  the  detection  rate  of  particles  will 
be 

da  dA  .  . 

w  =  nin2vV—jj.  (7.30) 

Note  that  N  and  /  in  (7.27)  correspond  to  n2V  and  n\V  in  this  formula. 

In  our  particular  case,  of  hard-sphere  scattering,  the  differential  cross- 
section  (7.29)  is  independent  of  9*,  and  the  scattering  is  therefore  isotropic 
in  the  CM  frame.  In  other  words,  the  number  of  particles  detected  is 
independent  of  the  direction  in  which  we  place  the  detector  (so  long  as  it 
is  outside  the  unscattered  beam).  This  is  a  very  special  property  of  hard 
spheres,  and  is  not  true  in  general.  Nor  is  it  true  in  the  Lab  frame.  Indeed, 
the  reason  for  calculating  the  CM  differential  cross-section  first  is  that  the 
relation  between  b  and  9*  is  much  simpler  than  that  between  b  and  9,  so 
that  the  cross-section  takes  a  simpler  form  in  the  CM  frame  than  in  the 
Lab  frame. 

The  easiest  way  to  find  the  Lab  differential  cross-section  is  often  to 
compute  the  CM  differential  cross-section  first,  and  then  use  the  relation 

dfl*      sm9*d9*      dz*  .  ,  n 

"To"  =  — '■ — ZT7ZT  =  ~~a — '        wlth       2  =  cos6>,     z  =cos6>. 
dS2       smOaff  dz 

Let  us  assume  (for  a  reason  to  be  given  later)  that  m\  <  ni2-  Then  it 
follows  that 

Ckr=^d£* 

dn    dn*  dz  '  y  '  ' 

The  value  of  dz*/dz  may  be  found  from  (7.23),  which  may  be  written 

2  _   [(mi/m2)  +  z*]2  

1  +  2(7711/7712)2*  +  {jnxjm-i)1 

It  is  straightforward,  if  tedious,  to  solve  for  z*  and  differentiate,  but  the 
resulting  expression  for  the  general  case  is  not  particularly  illuminating, 
and  we  shall  omit  it. 
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If  mi  >  rri2,  there  is  a  complication  which  arises  from  the  fact  that  for 
each  value  of  9  less  than  the  maximum  scattering  angle  #max  there  are  two 
possible  values  for  9* ,  as  is  clear  from  Fig.  7.6.  If  we  measure  only  the 
direction  of  the  scattered  particles,  we  cannot  distinguish  these  two,  and 
must  add  the  contributions  to  dcr/dO  from  da/dQ.*  at  these  two  separate 
values  of  6* .  (Moreover,  for  one  of  these  two,  dz*/dz  is  negative,  and  must 
be  replaced  by  its  absolute  value.)  The  two  values  of  9*  can  be  distinguished 
if  we  can  measure  also  the  energy  or  momentum  of  the  scattered  particles. 

We  shall  consider  explicitly  only  the  case  of  equal  mass  particles,  m\  = 

Example:  Scattering  of  equal  hard  spheres 

Find  the  Lab-frame  differential  cross-section  for  the  scattering 
of  equal-mass  hard  spheres. 

Here  the  relation  between  the  scattering  angles,  (7.23),  simplifies  to 
9*  =  29,  or  z*  =  2z2  —  1.  Consequently,  dz*/dz  =  4z,  and  the  Lab 
differential  cross-section  corresponding  to  (7.29)  is 


^  =  a2  cos 9,        (6><±tt).  (7.32) 


0,       (6»  >  ±tt).  (7.33) 


Of  course,  no  particles  are  scattered  through  angles  greater  than  w/2,  so 
da 

dn 

In  this  case,  the  cross-section  is  peaked  towards  the  forward  direction, 
and  more  particles  will  enter  a  detector  placed  at  a  small  angle  to  the 
beam  direction  than  one  placed  nearly  at  right  angles. 

As  a  check  of  this  result,  we  may  evaluate  the  total  cross-section  a, 
which  must  agree  with  the  value  a  =  An  x  ja2  =  Tia2  obtained  in  the 
CM  frame.  We  find 

f  f  d(T  /*27T  /17r/2 

o  —  II  ——  dfl  =  /     dip         a2  cos  #  sin  #  d#  =  7ra2, 

JJ  dn       j0  j0 

as  expected.  (Note  the  upper  limit  of  tv/2  on  the  9  integration,  which 
arises  because  of  (7.33).) 
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We  assumed  at  the  beginning  of  this  section  that  the  target  particles  and 
beam  particles  were  distinguishable,  and  that  the  aim  of  the  experiment 
was  to  count  scattered  beam  particles  only.  However,  it  is  easy  to  relax 
this  condition.  We  shall  now  calculate  the  rate  at  which  recoiling  target 
particles  enter  the  detector.  In  a  collision  in  which  the  incoming  particle 
has  impact  parameter  b,  and  is  moving  in  a  plane  specified  by  the  angle  p, 
the  target  particle  will  emerge  in  a  direction  specified  by  the  polar  angles 
a,  ip,  where  a  is  related  to  0*  or  b  by  (7.21)  and  ip  =  it  +  ip.  The  number  of 
target  particles  emerging  within  the  angular  range  da,  dip  is  therefore  equal 
to  the  number  of  incident  particles  in  the  corresponding  range  d&,  dtp.  To 
determine  the  number  of  recoiling  particles  entering  the  detector,  we  have 
to  relate  da  to  the  solid  angle 

dSl2  =  sin  a  da  dip- 
By  (7.21),  we  have  cos  8*  =  1  —  2  cos2  a,  whence 

d(J  do  /  1      \  ,rrr,A\ 

d^  =  dfF4cosa     {a<^-  (7'34) 

In  the  particular  case  of  equal-mass  hard  spheres,  this  differential  cross- 
section  has  exactly  the  same  form  as  (7.32).  This  shows  that  in  the  scatter- 
ing of  identical  hard  spheres  the  numbers  of  scattered  particles  and  recoiling 
target  particles  entering  the  detector  are  precisely  equal,  and  the  total  de- 
tection rate  is  obtained  by  doubling  that  found  earlier. 


7.5  Summary 

For  our  later  work,  the  most  important  result  of  this  chapter  is  that  the  total 
momentum,  angular  momentum  and  kinetic  energy  in  an  arbitrary  frame 
differ  from  those  in  the  CM  frame  by  an  amount  equal  to  the  contribution 
of  a  particle  of  mass  M  moving  with  the  centre  of  mass. 

We  have  seen  that  for  a  two-particle  system  the  use  of  the  CM  frame 
is  often  a  considerable  simplification,  and  this  is  true  also  for  more  com- 
plicated systems.  When  we  need  results  in  some  other  frame,  it  is  often 
best  to  solve  the  problem  first  in  the  CM  frame,  and  then  transform  to  the 
required  frame.  This  is  true  for  example  in  the  calculation  of  scattering 
cross-sections  in  two-particle  collisions. 
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Problems 

1.  A  double  star  is  formed  of  two  components,  each  with  mass  equal  to 
that  of  the  Sun.  The  distance  between  them  is  1  AU  (see  Chapter  4, 
Problem  2).  What  is  the  orbital  period? 

2.  Where  is  the  centre  of  mass  of  the  Sun-Jupiter  system?  (The  mass 
ratio  is  Ms/Mj  =  1047.  See  Chapter  4,  Problems  2  and  3.)  Through 
what  angle  does  the  Sun's  position  as  seen  from  the  Earth  oscillate 
because  of  the  gravitational  attraction  of  Jupiter? 

3.  The  parallax  of  a  star  (the  angle  subtended  at  the  star  by  the  radius  of 
the  Earth's  orbit)  is  w.  The  star's  position  is  observed  to  oscillate  with 
angular  amplitude  a  and  period  r.  If  the  oscillation  is  interpreted  as 
being  due  to  the  existence  of  a  planet  moving  in  a  circular  orbit  around 
the  star,  show  that  its  mass  vti\  is  given  by 

mi__a  /Mte\2/3 
Ms  ~  vj  \  Mst  J 

where  M  is  the  total  mass  of  star  plus  planet,  Ms  is  the  Sun's  mass, 
and  te  =  lyear.  Evaluate  the  mass  m\  if  M  =  0.25Ms,  r  =  16  years, 
zd  =  0.5"  and  a  =  0.01".  What  conclusion  can  be  drawn  without 
making  the  assumption  that  the  orbit  is  circular? 

4.  Two  particles  of  masses  rri\  and  m.2  are  attached  to  the  ends  of  a  light 
spring.  The  natural  length  of  the  spring  is  I,  and  its  tension  is  k  times 
its  extension.  Initially,  the  particles  are  at  rest,  with  nil  at  a  height  I 
above  mi-  At  t  =  0,  mi  is  projected  vertically  upward  with  velocity  v. 
Find  the  positions  of  the  particles  at  any  subsequent  time  (assuming 
that  v  is  not  so  large  that  the  spring  is  expanded  or  compressed  beyond 
its  elastic  limit). 

5.  *Prove  that  in  an  elastic  scattering  process  the  angle  0  +  a  between  the 
emerging  particles  is  related  to  the  recoil  angle  a  by 

tan(#  +  a)      ni\  +  mi 
tana         m\  —  ni2 

[Hint:  Express  both  tangents  in  terms  of  tan  \0* ■)  What  is  the  mass 
ratio  if  the  particles  emerge  at  right  angles  to  each  other? 

6.  A  proton  is  elastically  scattered  through  an  angle  of  56°  by  a  nucleus, 
which  recoils  at  an  angle  a  =  60°.  Find  the  atomic  mass  of  the  nucleus, 
and  the  fraction  of  the  kinetic  energy  transferred  to  it. 
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7.  An  experiment  is  to  be  designed  to  measure  the  differential  cross-section 
for  elastic  pion-proton  scattering  at  a  CM  scattering  angle  of  70°  and  a 
pion  CM  kinetic  energy  of  490  keV.  (The  electron-volt  (eV)  is  the  atomic 
unit  of  energy.)  Find  the  angles  in  the  Lab  at  which  the  scattered 
pions,  and  the  recoiling  protons,  should  be  detected,  and  the  required 
Lab  kinetic  energy  of  the  pion  beam.  (The  ratio  of  pion  to  proton  mass 
is  1/7.) 

8.  *An  unstable  particle  of  mass  M  =  vri\  +  decays  into  two  particles 
of  masses  rri\  and  mi-,  releasing  an  amount  of  energy  Q.  Determine 
the  kinetic  energies  of  the  two  particles  in  the  CM  frame.  Given  that 
mi/m.2  =  4,  Q  =  lMeV,  and  that  the  unstable  particle  is  moving  in 
the  Lab  with  kinetic  energy  2.25  MeV,  find  the  maximum  and  minimum 
Lab  kinetic  energies  of  the  particle  of  mass  mi. 

9.  The  molecules  in  a  gas  may  be  treated  as  identical  hard  spheres.  Find 
the  average  loss  of  kinetic  energy  of  a  molecule  with  kinetic  energy  T 
in  a  collision  with  a  stationary  molecule.  (Hint:  Use  the  fact  that  the 
collisions  are  isotropic  in  the  CM  frame,  so  that  all  values  of  cos#* 
between  ±1  are  equally  probable.)  How  many  collisions  are  required, 
on  average,  to  reduce  the  velocity  of  an  exceptionally  fast  molecule  by 
a  factor  of  1000? 

10.  Two  identical  charged  particles,  each  of  mass  m  and  charge  e,  are  ini- 
tially far  apart.  One  of  the  particles  is  at  rest  at  the  origin,  and  the 
other  is  approaching  it  with  velocity  v  along  the  line  x  =  b,  y  =  0, 
where  b  =  e1  /2neomv2.  Find  the  scattering  angle  in  the  CM  frame, 
and  the  directions  in  which  the  two  particles  emerge  in  the  Lab.  (See 
§4.7.) 

11.  *Find  the  distance  of  closest  approach  for  the  particles  in  Problem  10, 
and  the  velocity  of  each  at  the  moment  of  closest  approach. 

12.  Obtain  the  relation  between  the  total  kinetic  energy  in  the  CM  and 
Lab  frames.  Discuss  the  limiting  cases  of  very  large  and  very  small 
mass  for  the  target. 

13.  ^Suppose  that  the  asteroid  of  Chapter  4,  Problem  17,  has  a  mass  of 
6  x  1020  kg.  Find  the  proportional  change  in  the  kinetic  energy  of  the 
Earth  in  this  encounter.  What  is  the  resulting  change  in  the  semi-major 
axis  of  the  Earth's  orbit?  By  how  much  is  its  orbital  period  lengthened? 
(Note  that  the  postulated  event  is  exceedingly  improbable.) 

14.  Calculate  the  differential  cross-section  for  the  scattering  of  identical 
hard  spheres  directly  in  the  Lab  frame. 


176 


Classical  Mechanics 


15.  Find  the  Lab  differential  cross-section  for  the  scattering  of  identical 
particles  of  charge  e  and  mass  to,  if  the  incident  velocity  is  v.  (See 
(4.50).) 

16.  At  low  energies,  protons  and  neutrons  behave  roughly  like  hard  spheres 
of  equal  mass  and  radius  about  1.3  x  10_14m.  A  parallel  beam  of 
neutrons,  with  a  flux  of  3  x  1010  neutrons  m_2s_1,  strikes  a  target 
containing  4  x  1022  protons.  A  circular  detector  of  radius  20  mm  is 
placed  0.7m  from  the  target,  in  a  direction  making  an  angle  of  30°  to 
the  beam  direction.  Calculate  the  rate  of  detection  of  neutrons,  and  of 
protons. 

17.  *Write  down  the  equations  of  motion  for  a  pair  of  charged  particles  of 
equal  masses  to,  and  of  charges  q  and  —q,  in  a  uniform  electric  field  E. 
Show  that  the  field  does  not  affect  the  motion  of  the  centre  of  mass. 
Suppose  that  the  particles  are  moving  in  circular  orbits  with  angular 
velocity  uj  in  planes  parallel  to  the  xy-plane,  with  E  in  the  2-direction. 
Write  the  equations  in  a  frame  rotating  with  angular  velocity  uj,  and 
hence  find  the  separation  of  the  planes. 


Chapter  8 

Many-Body  Systems 


Any  material  object  may  be  regarded  as  composed  of  a  large  number  of 
small  particles,  small  enough  to  be  treated  as  essentially  point-like,  but  still 
large  enough  to  obey  the  laws  of  classical  rather  than  quantum  mechanics. 
These  particles  interact  in  complicated  ways  with  each  other  and  with  the 
environment.  However,  as  we  shall  see,  if  we  are  interested  only  in  the 
motion  of  the  object  as  a  whole,  many  of  these  details  are  irrelevant. 

We  shall  consider  in  this  chapter  a  general  system  of  ./V  particles  labelled 
by  an  index  i  =  1,  2, . . . ,  N,  interacting  through  two-body  forces,  and  also 
subjected  to  external  forces  due  to  bodies  outside  the  system.  We  denote 
the  force  on  the  ith  particle  due  to  the  jth  particle  by  Fy ,  and  the  external 
force  on  the  ith  particle  by  Fj.  Then  the  equations  of  motion  are 


Here,  and  throughout  this  chapter,  the  sum  is  over  all  particles  of  the 
system,  j  =  1, 2, . . . ,  TV.  Of  course,  there  is  no  force  on  the  ith  particle  due 
to  itself,  and  so  Fa  =  0.  The  sum  in  this  case  is  really  over  the  other  N—l 
particles. 


8.1    Momentum;  Centre-of-mass  Motion 

The  position  R  of  the  centre  of  mass  is  defined,  as  in  (7.2),  by 


mif-i  =  Fn  +  Fi2  +  ■  ■  ■  +  FiN  +  Fi  =  Y,  Fij  +  *V 


(8.1) 


3 


where 


(8.2) 
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The  total  momentum  is 

p  =  E  mif  *  =  (8-3) 

i 

It  is  equal  to  the  momentum  of  a  particle  of  mass  M  located  at  the  centre 
of  mass. 

The  rate  of  change  of  momentum,  by  (8.1),  is 

p  =  EE^+EF*-  (8-4) 

i       j  i 

Now  the  two-body  forces  must  satisfy  Newton's  third  law 

F0l  =  —Fij.  (8.5) 

Thus,  for  every  term  in  the  double  sum  in  (8.4),  there  is  an  equal  and 
opposite  term  Fji .  The  terms  therefore  cancel  in  pairs  and  the  double  sum 
is  zero.  (Compare  the  discussion  at  the  end  of  §1.3  for  the  three-particle 
case.) 

Hence  we  obtain  the  important  result  that  the  rate  of  change  of  mo- 
mentum is  equal  to  the  sum  of  the  external  forces  alone: 

P  =  MR  =  Y,Fl.  (8.6) 

i 

In  the  special  case  of  an  isolated  system  of  particles,  acted  on  by  no  external 
forces,  this  yields  the  law  of  conservation  of  momentum, 

P  =  MR  =  constant.  (8.7) 

Let  us  now  regard  our  system  of  particles  as  forming  a  composite  body. 
If  the  body  is  isolated,  then  according  to  Newton's  first  law  it  moves  with 
uniform  velocity.  Thus  we  see  that  to  maintain  this  law  for  composite 
bodies,  we  should  define  the  position  of  such  a  body  to  mean  the  position 
of  its  centre  of  mass.  Moreover,  with  this  definition,  Newton's  second  law  is 
just  (8.6),  provided  that  we  interpret  the  force  on  the  body  in  the  obvious 
way  to  mean  the  sum  of  the  external  forces  on  all  of  its  constituent  particles, 
and  the  mass  as  the  sum  of  their  masses.  It  is  also  clear  that  if  Newton's 
third  law  applies  to  each  pair  of  particles  from  two  composite  bodies,  then 
it  will  apply  to  the  bodies  as  a  whole. 

Thus,  with  a  suitable  (and  very  natural)  interpretation  of  the  concepts 
involved,  Newton's  three  basic  laws  may  be  applied  to  composite  bodies  as 
well  as  to  point  particles.  (Though  we  have  phrased  the  discussion  in  terms 
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of  collections  of  particles,  we  could  also  include  the  case  of  a  continuous 
distribution  of  matter,  by  dividing  it  up  into  infinitesimal  particles.)  It 
follows  that,  so  long  as  we  are  interested  only  in  the  motion  of  a  body  as  a 
whole,  we  may  replace  it  by  a  particle  of  mass  M  located  at  the  centre  of 
mass.  This  is  a  result  of  the  greatest  importance,  for  it  allows  us  to  apply 
our  earlier  discussion  of  particle  motion  to  real  physical  bodies.  We  have  of 
course  implicitly  assumed  it  in  many  of  our  applications;  for  example,  we 
treated  the  planets  as  point  particles  in  discussing  their  orbital  motion. 

There  is  one  point  about  which  one  must  be  careful.  We  may  still  need 
some  information  about  the  actual  shape  of  the  body  to  calculate  the  total 
force  acting  on  it.  This  force  is  not  necessarily  equal  to  that  on  a  particle 
of  mass  M  at  the  centre  of  mass.  In  the  gravitational  case,  it  happens  to 
be  so  if  the  body  is  spherical,  or  if  the  gravitational  field  is  uniform,  but 
not  in  general  otherwise. 

As  an  example  of  the  use  of  the  momentum  conservation  equation  for 
an  isolated  system,  we  consider  the  motion  of  a  rocket. 

Example:  Rockets 

A  rocket,  initially  at  rest,  emits  matter  with  a  constant  velocity 
u  relative  to  the  rocket  (though  not  necessarily  at  a  uniform 
rate).  Gravity  is  negligible.  If  the  initial  total  mass  of  the  rocket 
is  Mo,  find  its  velocity  v  when  the  mass  has  fallen  to  M. 

Consider  the  emission  of  a  small  mass  dm.  After  the  emission,  the 
mass  of  the  rocket  will  be  reduced  by  this  amount, 


and  its  velocity  will  be  increased  to  some  value  v  +  dv.  (See  Fig.  8.1.) 
This  velocity  can  be  found  from  the  momentum  conservation  equation 

(M  -  dm)(v  +  dv)  +  dm(v  -  u)  =  Mv. 


dM  =  -dm, 


before 


after 


Fig.  8.1 
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Neglecting  second-order  infinitesimals,  we  find 


M  dv  =  udm. 


(8.9) 


By  (8.8),  we  can  write  this  in  the  form 

dv  _  dM 

whence,  integrating,  and  using  the  initial  condition  M  =  Mo  when  v  =  0, 
we  have 


This  relation  may  be  solved  to  give  the  mass  as  a  function  of  the 
velocity  attained: 


This  shows  that  to  accelerate  a  rocket  to  a  velocity  equal  to  its  ejection 
velocity  u,  we  must  eject  all  but  a  fraction  1/e  of  its  original  mass. 

Note  that  the  velocity  of  the  rocket  depends  only  on  the  ejection  velocity 
and  the  fraction  of  the  initial  mass  which  has  been  ejected,  and  not  on  the 
rate  of  ejection.  It  makes  no  difference  whether  the  acceleration  is  brief  and 
intense,  or  prolonged  and  gentle.  (This  assumes  of  course  that  the  rocket 
is  not  subjected  to  other  forces,  such  as  gravity,  during  its  acceleration. 
It  would  clearly  be  useless  to  try  to  escape  from  the  Earth  with  a  rocket 
providing  a  long,  slow  acceleration,  because  it  would  be  constantly  retarded 
by  gravity.) 

For  interplanetary  flights,  rockets  are  normally  used  only  for  brief  spells, 
between  which  the  spacecraft  moves  in  a  free  orbit  —  as,  for  example,  in 
Chapter  4,  Problem  16,  where  these  brief  spells  are  at  or  near  planetary 
departures  and  arrivals.  If  the  duration  of  the  rocket  bursts  is  sufficiently 
short  for  the  change  in  position  of  the  rocket  during  each  burst  to  be  neg- 
ligible, then  it  is  a  good  approximation  to  assume  that  on  each  occasion 
the  velocity  is  changed  instantaneously,  by  an  amount  known  as  the  veloc- 
ity impulse.  The  relevant  quantity  for  determining  the  mass  of  the  rocket 
required  to  deliver  a  given  payload,  using  a  given  ejection  velocity,  via  a 
given  orbital  manoeuvre,  is  then  the  sum  of  the  velocity  impulses  (in  the 
ordinary,  not  the  vector,  sense).  For  example,  the  minimum  velocity  im- 
pulse needed  to  escape  from  the  Earth  is  llkms-1.  For  the  return  trip, 


V  -  \nM 

u  Mq 


M  =  M0e 


-v/u 


(8.10) 
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it  is  22kms  1,  if  the  deceleration  on  re-entry  into  the  atmosphere  is  to  be 
produced  by  the  rockets  rather  than  atmospheric  friction. 

8.2    Angular  Momentum;  Central  Internal  Forces 

The  total  angular  momentum  of  our  system  of  particles  is 

J  =  ^mjr,AT',.  (8.11) 

i 

The  rate  of  change  of  J  is 

i  i       j  i 

Now  let  us  examine  the  contribution  to  (8.12)  from  the  internal  force  be- 
tween a  particular  pair  of  particles,  say  1  and  2.  (See  Fig.  8.2.)  For  sim- 
plicity, let  us  write  r  =  r\  —  r2,  and  F  =  F12,  so  that  also  F21  =  —F. 


Fig.  8.2 

The  contribution  consists  of  two  terms, 

ri  A  F12  +  r2  A  F2i  =  r1  A  F  -  r2  A  F  =  r  A  F.  (8.13) 

This  contribution  will  be  zero  if  F  is  a  central  internal  force,  parallel  to  r. 

Let  us  for  the  moment  simply  assume  that  all  the  internal  forces  are 
central.  (We  shall  discuss  the  possible  justification  for  this  assumption 
shortly.)  Then  all  the  terms  in  the  double  sum  in  (8.12)  will  cancel  in 
pairs,  just  as  they  did  in  the  evaluation  of  the  rate  of  change  of  total 
linear  momentum.    The  total  moment  of  all  the  internal  forces  will  be 
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zero.  So,  when  the  internal  forces  are  central,  the  rate  of  change  of  angular 
momentum  is  equal  to  the  sum  of  the  moments  of  the  external  forces  only: 

j^c.AF,.  (8.14) 

i 

In  particular,  for  an  isolated  system,  with  all  Ft  =  0,  we  have  the  law 
of  conservation  of  angular  momentum, 

J  =  constant.  (8.15) 

More  generally,  this  is  true  if  all  the  external  forces  are  directed  towards, 
or  away  from,  the  origin  (which  may  be  true  for  example  if  they  are  due  to 
the  gravitational  attraction  of  a  spherical  body) . 

Although  we  have  only  shown  that  (8.14)  and  (8.15)  hold  in  the  special 
case  of  central  internal  forces,  they  are  actually  of  much  more  general  valid- 
ity, for  reasons  that  should  become  clearer  in  Chapter  12.  It  is  certainly  not 
true  that  all  the  internal  forces  in  real  solids,  liquids  or  gases  are  central. 
Many  are,  but  there  are  obvious  exceptions,  notably  the  electromagnetic 
force  between  moving  charges.  It  is  better  to  regard  (8.14)  as  a  basic  pos- 
tulate of  the  dynamics  of  composite  bodies,  justified  by  the  fact  that  the 
predictions  derived  from  it  agree  with  observations. 

It  is  often  convenient,  as  we  did  in  Chapter  7,  to  separate  the  contri- 
butions to  J  from  the  centre-of-mass  motion  and  the  relative  motion.  We 
define  the  positions  r*  of  the  particles  relative  to  the  centre  of  mass  by 

Ti=R  +  r*.  (8.16) 

Clearly,  the  position  of  the  centre  of  mass  relative  to  itself  is  zero,  so  that 

MR*  =^mir*  =0.  (8.17) 

i 

Now,  substituting  (8.16)  into  (8.11),  we  obtain 

j  =  ^2    R  A  R  +  (y!  nhr^j A  R + R  A  (y!  j 

i 

The  second  and  third  terms  vanish  by  virtue  of  (8.17).  Thus  we  can  write 

J  =  MRAR+  J*,  (8.18) 


Many-Body  Systems 


183 


where  J*  is  the  angular  momentum  about  the  centre  of  mass,  namely 

J*  r^r*  A  r*.  (8.19) 

i 

It  is  easy  to  find  the  rate  of  change  of  J*.  For,  by  (8.6), 

^-(MRAR)  =  MRAR  =  RAy  F>.  (8.20) 
at  ^-^ 

i 

Hence,  subtracting  from  (8.14),  and  using  (8.16),  we  obtain 

j*=5>*A.FV  (8.21) 

i 

Tims  the  rate  of  change  of  the  angular  momentum  J*  about  the  centre  of 
mass  is  equal  to  the  sum  of  the  moments  of  the  external  forces  about  the 
centre  of  mass.  This  is  a  remarkable  result.  For,  it  must  be  remembered 
that  the  centre  of  mass  is  not  in  general  moving  uniformly.  In  general,  we 
may  take  moments  about  the  origin  of  any  inertial  frame,  but  it  would 
be  quite  wrong  to  take  moments  about  an  accelerated  point.  Only  in  very 
special  cases,  such  as  where  the  point  is  the  centre  of  mass,  are  we  allowed 
to  do  this. 

This  result  means  that  in  discussing  the  rotational  motion  of  a  body 
we  can  ignore  the  motion  of  the  centre  of  mass,  and  treat  it  as  though  it 
were  fixed.  It  is  particularly  important  in  the  case  of  rigid  bodies,  which 
we  discuss  in  Chapter  9. 

For  an  isolated  system,  it  follows  from  (8.21)  that  J*  as  well  as  J  is 
a  constant.  More  generally,  J*  is  a  constant  if  the  external  forces  have 
zero  total  moment  about  the  centre  of  mass.  For  example,  for  a  system 
of  particles  in  a  uniform  gravitational  field,  the  resultant  force  acts  at  the 
centre  of  mass,  and  J*  is  therefore  a  constant. 

8.3    The  Earth-Moon  System 

As  an  interesting  example  of  the  use  of  the  angular  momentum  conservation 
law,  we  consider  the  system  comprising  the  Earth  and  the  Moon.  (We 
ignore  the  other  planets,  and  treat  the  Sun  as  fixed.) 
The  angular  momentum  of  this  system  is 


J  =  MR  A  R+  J*, 


(8.22) 
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where  M  =  m  +  m!  is  the  total  mass  of  Earth  plus  Moon,  and  J*  is  the 
angular  momentum  about  the  centre  of  mass.  This  angular  momentum 
J*  can  again  be  separated  into  an  orbital  term,  due  to  the  motion  of  the 
Earth  and  Moon  around  their  common  centre  of  mass,  and  two  rotational 
terms,  and  Jj!^,  due  to  the  rotation  of  each  body  about  its  own  centre. 
The  orbital  angular  momentum,  according  to  (7.15),  is  fir  A  f,  where  fi  = 
mm' /M.  Thus 


Let  us  first  neglect  the  non-uniformity  of  the  Sun's  gravitational  field 
over  the  dimensions  of  the  Earth-Moon  system.  Then  J*  as  well  as  J 
will  be  constant.  Indeed,  to  a  good  approximation,  the  individual  terms 
of  (8.23)  are  separately  constant.  However,  over  long  periods  of  time  they 
do  change,  for  at  least  two  reasons.  One  is  the  non-spherical  shape  of  the 
Earth,  which,  as  we  saw  in  §6.5,  leads  to  a  precession  of  the  Moon's  orbital 
plane.  This  is  too  small  to  be  of  much  importance  in  the  present  context 
(the  non- uniformity  of  the  Sun's  field  produces  a  much  bigger  effect),  but 
because  of  angular  momentum  conservation  there  is  a  corresponding  pre- 
cession of  the  Earth's  angular  momentum  which  is  important.  We  shall 
discuss  this  effect  further  in  §10.1. 

The  second  reason  is  the  effect  of  tidal  friction.  The  dissipation  of 
energy  by  the  tides  has  the  effect  of  gradually  slowing  the  Earth's  rotation. 
We  can  picture  this  effect  as  follows:  as  the  Earth  rotates,  it  tries  to  carry 
with  it  the  tidal  'bulges',  while  the  Moon's  attraction  is  pulling  them  back 
into  line.  (See  Fig.  8.3.)  Thus  there  is  a  couple  acting  to  slow  the  Earth's 


rotation,  and  a  corresponding  equal  and  opposite  couple  tending  to  increase 
the  orbital  angular  momentum. 

To  investigate  this  effect,  we  shall  make  the  simplest  possible  approx- 
imations. We  neglect  the  rotational  angular  momentum  J*M  of  the  Moon 


J*  =  fir  A  r  +  Je  +  J^. 


(8.23) 


Fig.  8.3 
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(which  is  in  fact  very  small  compared  to  the  other  terms  in  (8.23)),  and 
assume  that  the  Earth's  axis  is  perpendicular  to  the  Moon's  orbital  plane, 
so  that  the  remaining  terms  of  (8.23)  have  the  same  direction.  The  Earth's 
rotational  angular  momentum,  as  we  shall  see  in  the  next  chapter,  has  the 
form  Jg  =  Iu>,  where  I  is  the  so-called  moment  of  inertia,  and  u>  the  rota- 
tional angular  velocity.  For  a  sphere  of  uniform  density,  mass  m  and  radius 
r,  I  has  the  value  I  =  |mr2.  In  fact,  the  Earth's  density  increases  to- 
wards its  centre,  and  I  is  therefore  somewhat  less  than  this,  approximately 
/  =  0.33mr2.  Thus  the  total  angular  momentum  about  the  centre  of  mass 
is 

J*  =  na2fl  +  0.33mr2w,  (8.24) 

where  Q  is  the  Moon's  orbital  angular  velocity.  It  is  convenient  to  express 
the  angular  velocities  in  terms  of  the  present  value  ujq  of  uj,  given  by  (5.1). 
Using  m/ n  =  82.3,  we  may  write  the  conservation  law  (8.24)  (dividing  by 
/ir2u)o)  as 


(?) 


'  '2il+  27.2—  =  160,  (8.25) 

LU0  UJq 


where  the  value  of  the  constant  on  the  right  is  obtained  by  using  the  present 
values  a/r  =  60.3  and  £1/u>q  =  0.0365.  Note  that  at  present  the  first  term 
of  (8.25)  is  almost  five  times  as  large  as  the  second. 

Now  the  orbital  angular  velocity     is  related  to  a  by  (4.32),  i.e., 

fl2a3  =  GM, 

or,  equivalently 

^  =  17.in3/2.  (8.26) 

UJq  \aS 

Inserting  this  into  (8.25)  yields  for  the  Earth's  rotational  angular  velocity 

—  =  5.88  -  0.629  (-Y/2  .  (8.27) 
ujo  \rJ 

As  ui  decreases  due  to  tidal  friction,  a  must  increase,  i.e.,  the  Moon  moves 
further  away  from  the  Earth.  Indeed  this  effect  can  now  be  measured:  the 
Moon  is  currently  receding  from  the  Earth  at  about  3.8cmyr_1.  Hence 
decreases  too.  If  we  use  (8.26)  and  (8.27)  to  plot     and  u>  as  functions  of  a 
we  obtain  curves  like  those  of  Fig.  8.4  (which  are  not  drawn  to  scale). 
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Fig.  8.4 


It  is  important  to  note  that  if  at  any  time  Q  were  actually  larger  than 
uj,  so  that  the  Moon  revolved  around  the  Earth  in  less  than  a  day,  then 
the  tidal  forces  would  act  to  increase  the  angular  velocities  and  decrease 
the  separation.  On  this  simple  picture,  the  smallest  distance  there  can 
have  been  between  the  bodies  corresponds  to  the  first  point  where  the 
curves  cross  (about  2.3  Earth  radii,  with  a  common  rotation  period  of 
under  5  hours).  However,  it  is  in  fact  impossible  that  the  Moon  should  ever 
have  been  so  close,  because  it  would  then  have  been  inside  the  Roche  limit 
(about  2.9  Earth  radii)  where  the  tidal  forces  become  so  strong  that  they 
would  break  up  the  Moon  (see  Chapter  6,  Problem  20).  It  is  now  generally 
believed  that  the  material  that  later  formed  the  Moon  was  ejected  from  a 
proto-Earth  in  a  violent  collision  with  another  planet  early  in  its  history, 
and  then  gradually  condensed  into  the  present  Moon  in  an  orbit  with  a 
radius  of  a  few  Earth  radii. 

The  final  state  of  the  system  in  the  very  distant  future  would  correspond 
to  the  second  point  where  the  curves  cross.  Then  w  and  SI  would  again  be 
equal,  and  tidal  forces  would  play  no  further  role.  In  this  final  state,  the 
Earth  would  always  present  the  same  face  to  the  Moon,  as  the  Moon  already 
does  to  the  Earth  (except  for  a  small  wobble).  The  distance  between  Earth 
and  Moon  would  then  be  about  half  as  large  again  as  it  now  is,  and  the 
common  rotation  period  (the  'day'  or  the  'month')  would  be  nearly  48  days. 

There  are,  however,  several  factors  which  might  modify  this  picture. 
It  was  at  one  time  believed  that  Mercury  and  the  Sun  had  reached  the 
final  state  described  above,  and  that  Mercury  always  presents  the  same 
face  to  the  Sun.   However,  it  is  now  known  that  the  ratio  of  the  orbital 
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and  rotation  periods  is  not  unity  but  3/2.  There  are  several  other  cases  in 
the  solar  system  of  such  rational  ratios.  The  stability  of  this  configuration 
depends  on  the  unusually  large  eccentricity  of  Mercury's  orbit.  It  is  unlikely 
that  such  a  state  could  be  reached  by  the  Earth-Moon  system. 

The  'final'  state  would  in  any  case  not  be  the  end  of  the  evolution  of  the 
system,  because  the  solar  tides  would  still  be  active.  They  would  tend  to 
slow  the  Earth's  rotation  even  more,  and  would  therefore  gradually  pull  the 
Moon  back  towards  the  Earth.  In  the  very  remote  future,  the  two  bodies 
would  collide,  although  it  is  probable  that  the  Sun  will  have  reached  the 
end  of  its  life,  and  engulfed  the  whole  system,  long  before  that  happens. 

A  significant  over-simplification  in  our  argument  is  the  assumption  that 
the  Moon's  orbit  is  a  circle  in  a  plane  normal  to  the  Earth's  axis.  In  reality, 
both  its  eccentricity  and  its  inclination  can  change  with  time,  and  of  course 
the  Earth's  axis  also  precesses  is  a  complicated  way.  The  effect  of  tidal 
dissipation  is  not  only  to  make  the  Moon's  orbit  expand,  but  also  to  make 
it  more  eccentric  and  less  inclined.  In  the  distant  past,  it  may  have  been 
considerably  more  steeply  inclined  than  it  now  is. 

The  Sun  has  another  effect  which  has  been  omitted  from  our  discus- 
sion. The  solar  ocean  tides  are  indeed  small  compared  to  the  lunar  ones. 
However,  there  are  quite  substantial  semi-diurnal  tides  in  the  Earth's  at- 
mosphere, noticeable  as  fluctuations  in  the  barometric  pressure  at  sea  level. 
They  are  predominantly  of  thermal,  rather  than  gravitational,  origin,  due 
to  heating  during  the  hours  of  daylight,  and  are  very  different  in  character 
from  the  ocean  tides.  The  most  important  component  has  a  period  of  half 
a  solar,  rather  than  lunar,  day.  It  is  a  little  surprising  that  the  dominant 
period  is  12  hours,  rather  than  24.  This  may  be  an  example  of  the  phe- 
nomenon of  resonance  (see  §2.6):  the  amplitude  of  the  12-hour  oscillation 
may  be  enhanced  by  being  closer  to  a  natural  oscillation  period  of  the  at- 
mosphere than  is  24  hours.  As  on  the  oceans,  the  Sun  exerts  a  gravitational 
couple  on  the  resulting  bulges  in  the  atmosphere.  (So  does  the  Moon  of 
course,  but  its  effect  averages  out  to  zero  over  a  lunar  month.)  It  hap- 
pens that  the  maxima  in  this  case  occur  before,  not  after,  12  noon  and  12 
midnight.  Thus  the  couple  acts  to  speed  the  Earth's  rotation  rather  than 
to  slow  it.  This  process  has  the  effect  of  transferring  angular  momentum 
from  the  Earth's  orbital  motion  to  its  rotation.  It  therefore  tends  to  de- 
crease the  Earth's  orbital  radius,  pulling  it  (immeasurably  slowly)  closer  to 
the  Sun.  A  purely  gravitational  process  could  not  have  this  effect,  for  the 
energy  (kinetic  plus  potential)  is  actually  increasing.  This  energy  comes 
from  solar  heating,  with  the  Earth  acting  as  a  kind  of  giant  heat  engine 
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converting  heat  to  mechanical  work.  However,  the  net  effect  of  the  atmo- 
spheric tides  in  speeding  the  Earth's  rotation  is  not  nearly  large  enough  to 
counterbalance  that  of  the  ocean  tides  in  slowing  it. 

8.4    Energy;  Conservative  Forces 

The  kinetic  energy  of  our  system  of  particles  is 

T  =  J2  \m^-  (8-28) 

i 

As  in  the  case  of  angular  momentum,  T  can  be  separated  into  centre-of- 
mass  and  relative  terms.  Substituting  (8.16)  into  (8.28),  we  again  find  that 
the  cross  terms  drop  out  because  of  (8.17).  Thus  we  obtain 

T=\MR2+T*,  (8.29) 

where  T* ,  the  kinetic  energy  relative  to  the  centre  of  mass,  is 

T*  =  J2  \rmrf-  (8-30) 

i 

The  relations  (8.3),  (8.18)  and  (8.29)  show  that,  exactly  as  in  the  two- 
particle  case  (see  (7.16)),  the  momentum,  angular  momentum  and  kinetic 
energy  in  any  frame  are  obtained  from  those  in  the  CM  frame  by  adding 
the  contribution  of  a  particle  of  mass  M  located  at  the  centre  of  mass  R. 
The  rate  of  change  of  kinetic  energy  is 

t  =   »^  •  u  =  53  X)  * « ■  Fv + Z)  *  <  ■ Fi-  (8-31) 

i  i       j  i 

This  equation  may  be  compared  with  (8.12).  Let  us  again  examine  the 
contribution  from  the  internal  force  F\i  =  —F21  =  F  between  particles  1 
and  2  (see  Fig.  8.2).  As  in  (8.13),  it  is 

f  1  •  F  —  T"2  •  F  =  r  ■  F. 

This  is  of  course  the  rate  at  which  the  force  F  does  work.  However,  in 
contrast  to  the  previous  situation,  there  is  in  general  no  reason  for  this  to 
vanish,  and  T  cannot  therefore  be  written  in  terms  of  the  external  forces 
alone.  This  is  not  at  all  surprising,  because  in  general  we  must  expect  there 
to  be  some  change  in  the  internal  potential  energy  of  the  system. 

We  may  note  that  there  are  certain  special  cases  in  which  a  force  does 
no  work,  and  may  be  omitted  from  the  energy  change  equation.  This  will 
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be  the  case,  for  example,  if  r  =  0  (e.g.,  the  reaction  at  a  fixed  pivot),  or 
if  F  is  always  perpendicular  to  r  (e.g.,  the  magnetic  force  on  a  charged 
particle).  An  important  special  case  is  that  of  a  rigid  body,  in  which  the 
distances  between  all  the  particles  are  fixed.  Then  r  is  constant,  so  that 
r  ■  r  =  0.  If  the  force  is  central,  then  it  has  the  same  direction  as  r,  and 
is  always  perpendicular  to  r.  Thus  a  central  force  between  particles  whose 
distance  apart  is  fixed  does  no  work. 

Now,  however,  let  us  make  the  less  restrictive  assumption  that  all  the 
internal  forces  are  conservative.  Then  the  force  F  must  correspond  to  a 
potential  energy  function  of  particles  1  and  2,  and  the  rate  of  working  r  ■  F 
will  be  equal  to  minus  the  rate  of  change  of  this  potential  energy.  Let  us 
denote  the  internal  potential  energy  —  the  sum  of  the  potential  energies 
for  all  the  ±N(N  -  1)  pairs  of  particles  —  by  Vjnt  •  Then  clearly  the  rate  of 
working  of  all  the  internal  forces  will  be  minus  the  rate  of  change  of  Vint. 
Thus  we  obtain 


The  rate  of  change  of  the  kinetic  plus  internal  potential  energy  is  equal  to 
the  rate  at  which  the  external  forces  do  work. 

We  can  also  find  the  rate  of  change  of  energy  relative  to  the  centre  of 
mass.  From  (8.6),  we  have 


Hence,  subtracting  from  (8.32)  and  using  (8.29)  and  (8.16),  we  find 


This  equation  is  the  analogue  of  (8.21).  Note  that  Vint  is  a  function  only  of 
the  differences  r  j  —  r  j ,  and  is  therefore  unaffected  by  the  choice  of  reference 
frame.  Thus  T*  +  Vint  is  the  total  energy  in  the  CM  frame. 

Now  let  us  suppose  that  the  external  forces  are  also  conservative.  Then 
there  must  exist  a  corresponding  external  potential  energy  function  Vext, 
whose  rate  of  change  is  minus  the  rate  of  working  of  the  external  forces. 
From  (8.32)  we  then  find  the  law  of  conservation  of  energy 


(8.32) 


(8.33) 


T  +  V  =  E  =  constant,        where       V  =  Vext  +  Vint .  (8.34) 
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It  is  important  to  notice,  however,  that  Vext  depends  on  the  co-ordinates 
themselves,  not  merely  the  co-ordinate  differences,  and  is  therefore  depen- 
dent on  the  choice  of  frame,  unlike  V-lrA .  Thus  in  general  it  is  not  possible  to 
write  down  a  similar  equation  in  which  only  the  kinetic  energy  T*  relative 
to  the  centre  of  mass  appears. 


8.5    Lagrange's  Equations 

We  close  this  chapter  with  a  brief  discussion  of  the  extension  of  the  La- 
grangian  method  of  §3.7  to  a  many-particle  system.  We  consider  only  the 
case  where  all  the  forces  are  conservative,  so  that  there  exists  a  total  po- 
tential energy  function  V.  We  saw  in  §7.1  (see  the  discussion  following 
(7.9))  that  the  derivatives  of  the  potential  energy  function  Vint(^i  —  ^2) 
corresponding  to  a  two-body  force  with  respect  to  the  components  of  T\  or 
V2  give  correctly  minus  the  forces  on  the  corresponding  particles.  Thus  the 
components  of  the  total  force  on  the  ith  particle  will  be  minus  the  deriva- 
tives of  V  with  respect  to  the  components  of  r,.  The  equations  of  motion 
may  thus  be  written 

dV 

mtxt  =  -  —  ,       (i  =  l,2,...,N),  (8.35) 

together  with  similar  equations  for  the  y  and  2  components. 
Now,  clearly, 

dT  _ 

■— —  —  TTliXi, 

oxi 

while  dT/dxi  and  dV/dii  both  vanish.  Thus  (8.35)  are  identical  with 
Lagrange's  equations 

with  L  =  T  —  V. 

Exactly  as  in  §3.7,  it  follows  that  the  action  integral, 

/=  f1  Ldt, 

is  stationary  under  arbitrary  small  variations  of  the  3N  co-ordinates  n  = 
(xi,yi,Zi)  which  vanish  at  the  limits  of  integration.  Then,  again  as  in 
§3.7,  we  may  re-express  L  in  terms  of  any  other  set  of  3./V  co-ordinates, 
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qi,  Q2,  ■  ■  ■ ,  Qsn-  The  corresponding  equations  of  motion  will  be  Lagrange's 
equations 

As  an  example,  let  us  suppose  that  external  forces  are  provided  by  a 
uniform  gravitational  field  g. 

Example:  Uniform  gravitational  field 

Show  that  the  centre-of-mass  and  internal  motions  can  be  com- 
pletely separated  in  the  case  of  a  uniform  gravitational  field,  with 
two  separate  energy  conservation  laws. 

The  potential  energy  function  for  the  uniform  field  is 

Kxt  =        mg  ■  Ti  =  -Mg  ■  R. 

i 

Now,  let  us  write  the  Lagrangian  in  terms  of  the  three  co-ordinates  R 
and  the  co-ordinates  r*  (of  which  only  37V  —  3  are  independent,  because 
of  (8.17)).  Then,  using  (8.29),  we  find 

L  =  \MFL  +Mg-R  +  J2  mrf  ~  Vint-  (8.38) 

i 

Since  Vint  is  a  function  only  of  the  co-ordinate  differences  r,  —  rj  = 
r  *  —  r  * ,  this  Lagrangian  function  exhibits  a  complete  separation  between 
the  terms  involving  R  and  those  involving  the  r*,  just  as  in  the  two- 
particle  case  of  §7.1.  For  the  case  of  a  uniform  gravitational  field,  the 
motion  of  the  centre  of  mass  and  the  relative  motion  are  uncoupled.  In 
particular,  there  are  two  separate  conservation  laws  for  energy, 

\MR  -  Mg  ■  R  =  constant, 

and 

T*  +  Vint  =  constant. 

As  we  saw  in  §8.2,  there  is  also  in  this  case  a  separate  conservation  law 
for  angular  momentum  relative  to  the  centre  of  mass. 

There  is  an  important  physical  consequence  of  this  separation:  no  ob- 
servation of  the  internal  motion  of  a  system  can  ever  reveal  the  presence  of 
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a  uniform  gravitational  field,  since  such  a  field  in  no  way  affects  the  equa- 
tions of  motion  for  the  relative  co-ordinates  r*.  Note  that  a  laboratory  is 
subjected  to  other  external  forces  besides  the  Earth's  gravitational  field, 
since  it  is  supported  by  the  ground.  Indeed  if  the  Earth  were  removed,  but 
the  supporting  forces  somehow  retained,  there  would  be  no  observable  dif- 
ference inside  the  laboratory,  which  would  of  course  be  accelerated  upwards 
with  acceleration  g\  This  was  an  important  consideration  in  the  argument 
which  led  to  Einstein's  general  theory  of  relativity. 

8.6  Summary 

The  centre  of  mass  of  any  system  moves  like  a  particle  of  mass  M  acted 
on  by  a  force  equal  to  the  total  force  on  the  system.  The  contribution  of 
this  motion  to  the  angular  momentum  or  kinetic  energy  may  be  completely 
separated  from  the  contributions  of  the  relative  motion,  and  J  or  T  may 
be  written  as  a  sum  of  two  corresponding  terms.  (Of  course,  the  only 
contribution  to  P  comes  from  the  centre-of-mass  motion.) 

When  the  internal  forces  are  central,  the  rate  of  change  of  angular  mo- 
mentum is  equal  to  the  sum  of  the  moments  of  the  external  forces.  When 
they  are  conservative,  the  rate  of  change  of  the  kinetic  energy  plus  the  in- 
ternal potential  energy  is  equal  to  the  rate  of  working  of  the  external  forces. 
In  both  cases,  the  same  thing  is  true  for  the  motion  relative  to  the  centre 
of  mass. 

If  the  external  forces  are  also  central,  or  conservative,  then  the  total 
angular  momentum,  or  total  energy  (including  external  potential  energy), 
respectively,  are  conserved.  In  particular,  for  an  isolated  system,  P,  J  and 
T  +  Vint  are  all  constants. 


Problems 

1.  A  rocket  is  launched  from  the  surface  of  the  Earth,  to  reach  a  height  of 
50  km.  Find  the  required  velocity  impulse,  neglecting  the  variation  of 
g  with  height  (and  the  Earth's  rotation).  Given  that  the  mass  of  the 
payload  and  rocket  without  fuel  is  100  kg,  and  the  ejection  velocity  is 
2kms_1,  find  the  required  initial  mass. 

2.  A  satellite  is  orbiting  the  Earth  in  a  circular  orbit  230  km  above  the 
equator.  Calculate  the  total  velocity  impulse  needed  to  place  it  in  a 


Many-Body  Systems 


193 


synchronous  orbit  (see  Chapter  4,  Problem  1),  using  an  intermediate 
semi-elliptical  transfer  orbit  which  just  touches  both  circles.  (Hint: 
From  the  orbit  parameters  a  and  ae,  find  I,  and  hence  the  velocities.) 
Given  that  the  final  mass  to  be  placed  in  orbit  is  30  kg,  and  the  ejection 
velocity  of  the  rocket  is  2.5  kms-1,  find  the  necessary  initial  mass. 
3.  Assume  that  the  residual  mass  of  a  rocket,  without  pay  load  or  fuel,  is 
a  given  fraction  A  of  the  initial  mass  including  fuel  (but  still  without 
payload).  Show  that  the  total  take-off  mass  required  to  accelerate  a 
payload  m  to  velocity  v  is 

1  "  A 

Ma  = 


-v/u  _  \  ' 


If  A  =  0.15,  what  is  the  upper  limit  to  the  velocity  attainable  with  an 
ejection  velocity  of  2.5 kms-1? 

4.  Find  a  formula  analogous  to  that  of  Problem  3  for  a  two-stage  rocket, 
in  which  each  stage  produces  the  same  velocity  impulse.  (The  first- 
stage  rocket  is  discarded  when  its  fuel  is  burnt  out.)  With  the  figures 
of  Problem  3,  what  is  the  minimum  number  of  stages  required  to  reach 
escape  velocity  (11.2 kms-1)?  With  that  number,  what  take-off  mass 
is  required,  if  the  payload  mass  is  100  kg? 

5.  Find  the  velocity  impulse  needed  to  launch  the  spacecraft  on  its  trip 
to  Jupiter,  described  in  Chapter  4,  Problem  16.  (Hint:  Use  energy 
conservation  to  find  the  velocity  at  the  surface  of  the  Earth  needed  to 
give  the  appropriate  relative  velocity  for  the  spacecraft  once  it  has  es- 
caped.) If  a  three-stage  rocket  is  used,  and  the  parameter  A  of  Problem 
3  is  0.1,  what  is  the  minimum  required  ejection  velocity?  Given  that 
u  =  2.5kms-1,  and  that  the  mass  of  the  payload  is  500kg,  find  the 
total  take-off  mass. 

6.  Find  the  gain  in  kinetic  energy  when  a  rocket  emits  a  small  amount  of 
matter.  Hence  calculate  the  total  energy  which  must  be  supplied  from 
chemical  or  other  sources  to  accelerate  the  rocket  to  a  given  velocity. 
Show  that  this  is  equal  to  the  energy  required  if  an  equal  amount  of 
matter  is  ejected  while  the  rocket  is  held  fixed  on  a  test-bed. 

7.  *A  rocket  with  take-off  mass  Mq  is  launched  vertically  upward,  as  in 
Problem  1.  Consider  the  effect  of  a  finite  burn- up  time.  Show  that,  if 
the  rocket  ejects  matter  at  a  constant  rate  a,  then  its  height  at  time  t 


uM ,   M0     1    2  .  , 

ut  In  -rr  -  \gtA.       with       M  =  M0  —  at. 

a       M  2 
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Hence,  show  that  if  it  burns  out  after  a  time  t\,  leaving  a  final  mass 
Mi,  then  [provided  that  t\  <  (u/g)  ln(Mo/Mi)],  the  maximum  height 
reached  is 

2g  V    Mj  1\M0-M1     Mi  ; 

Using  the  same  values  of  u,  Mq  and  Mi  as  in  Problem  1,  find  the 
maximum  height  reached  if  the  burn-up  time  is  (a)  10  s,  and  (b)  30  s. 

8.  *A  satellite  is  to  be  launched  into  a  synchronous  orbit  directly  from  the 
surface  of  the  Earth,  using  a  rocket  launched  vertically  from  a  point 
on  the  equator.  Find  the  required  launch  speed  to  achieve  the  desired 
apogee,  and  the  required  velocity  increment.  With  the  same  figures  as 
in  Problem  2,  find  the  required  take-off  mass. 

9.  Two  billiard  balls  are  resting  on  a  smooth  table,  and  just  touching.  A 
third  identical  ball  moving  along  the  table  with  velocity  v  perpendicu- 
lar to  their  line  of  centres  strikes  both  balls  simultaneously.  Find  the 
velocities  of  the  three  balls  immediately  after  impact,  assuming  that 
the  collision  is  elastic. 

10.  A  spherical  satellite  of  radius  r  is  moving  with  velocity  v  through  a 
uniform  tenuous  atmosphere  of  density  p.  Find  the  retarding  force  on 
the  satellite  if  each  particle  which  strikes  it  (a)  adheres  to  the  surface, 
and  (b)  bounces  off  it  elastically.  Can  you  explain  why  the  two  answers 
are  equal,  in  terms  of  the  scattering  cross-section  of  a  hard  sphere? 

11.  *If  the  orbit  of  the  satellite  of  Problem  10  is  highly  elliptical,  the  re- 
tarding force  is  concentrated  almost  entirely  in  the  lowest  part  of  the 
orbit.  Replace  it  by  an  impulsive  force  of  impulse  /  delivered  once 
every  orbit,  at  perigee.  By  considering  changes  in  energy  and  angular 
momentum,  find  the  changes  in  the  parameters  a  and  I.  Show  that 
51  =  5a(l  —  e)2,  and  hence  that  the  effect  is  to  decrease  the  period  and 
apogee  distance,  while  leaving  the  perigee  distance  unaffected.  (The 
orbit  therefore  becomes  more  and  more  circular  with  time.)  Show  that 
the  velocity  at  apogee  increases,  while  that  at  perigee  decreases. 

12.  ^Suppose  that  the  satellite  of  Problems  10  and  11  has  achieved  a  cir- 
cular orbit  of  radius  a.  Find  the  rates  of  change  of  energy  and  angular 
momentum,  and  hence  show  that  the  rates  of  change  of  a  and  I  are 
equal,  so  that  the  orbit  remains  approximately  circular.  Show  also 
that  the  velocity  of  the  satellite  must  increase. 

13.  If  the  satellite  orbit  of  Problem  12  is  500  km  above  the  Earth's  sur- 
face, the  mass  and  radius  of  the  satellite  are  30  kg  and  0.7  m,  and 
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p  =  10_13kgm-3,  find  the  changes  in  orbital  period  and  height  in  a 
year.  If  the  height  is  200  km  and  p  =  10_10kgm-3,  find  the  changes  in 
a  single  orbit. 

14.  Find  the  lengths  of  the  'day'  and  the  'month'  (a)  when  the  Moon  was 
10  Earth  radii  away,  and  (b)  when  the  solar  and  lunar  tides  become 
equal  in  magnitude.  (See  Chapter  6,  Problem  16.) 

15.  Show  that  in  a  conservative  iV-body  system,  a  state  of  minimal  total 
energy  for  a  given  total  z-component  of  angular  momentum  is  necessar- 
ily one  in  which  the  system  is  rotating  as  a  rigid  body  about  the  2-axis. 
[Use  the  method  of  Lagrange  multipliers  (see  Appendix  A,  Problem 
11),  and  treat  the  components  of  the  positions  and  velocities  fj  as 
independent  variables.] 

16.  *A  planet  of  mass  M  is  surrounded  by  a  cloud  of  small  particles  in  orbits 
around  it.  Their  mutual  gravitational  attraction  is  negligible.  Due  to 
collisions  between  the  particles,  the  energy  will  gradually  decrease  from 
its  initial  value,  but  the  angular  momentum  will  remain  fixed,  J  =  Jo, 
say.  The  system  will  thus  evolve  towards  a  state  of  minimum  energy, 
subject  to  this  constraint.  Show  that  the  particles  will  tend  to  form 
a  ring  around  the  planet.  [As  in  Problem  15,  the  constraint  may  be 
imposed  by  the  method  of  Lagrange  multipliers.  In  this  case,  because 
there  are  three  components  of  the  constraint  equation,  we  need  three 
Lagrange  multipliers,  say  ux ,  uiy ,  uiz .  We  have  to  minimize  the  function 
E  —  u>  ■  (J  —  Jo)  with  respect  to  variations  of  the  positions  r«  and 
velocities  fj,  and  with  respect  to  u>.  Show  by  minimizing  with  respect 
to  r i  that  once  equilibrium  has  been  reached  the  cloud  rotates  as  a  rigid 
body,  and  by  minimizing  with  respect  to  rt  that  all  particles  occupy  the 
same  orbit.]  What  happens  to  the  energy  lost?  Why  does  the  argument 
not  necessarily  apply  to  a  cloud  of  particles  around  a  hot  star? 

17.  *An  ./V-body  system  is  interacting  only  through  the  gravitational  forces 
between  the  bodies.  Show  that  the  potential  energy  function  V  satisfies 
the  equation 

J2ri-ViV  =  -V, 

i 

where  V«  =  (d/dxi,d/dyi,d/dzi).  (Hint:  Show  that  each  two-body 
term  Vij  satisfies  this  equation.  This  condition  expresses  the  fact  that 
V  is  a  homogeneous  function  of  the  co-ordinates  of  degree  —1.) 
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18.  *Under  the  conditions  of  Problem  17,  show  that  the  total  kinetic  and 
potential  energies  T  and  V  satisfy  the  virial  equation, 

A2K 

2T+V=—,        where       K  =  |^m*r*  • 

i 

(Note  that  K  relates  to  the  overall  scale  of  the  system.  We  may  define 
a  root-mean-square  radius  r  by  K  =  ^Mr2.)  Deduce  that,  if  the  scale 
of  the  system  as  measured  by  K  is,  on  average,  neither  growing  nor 
shrinking,  then  the  time-averaged  value  of  the  total  energy  is  equal 
to  minus  the  time-averaged  value  of  the  kinetic  energy  —  the  virial 
theorem.  (Compare  Chapter  4,  Problem  19.) 


Chapter  9 

Rigid  Bodies 


The  principal  characteristic  of  a  solid  body  is  its  rigidity.  Under  normal 
circumstances,  its  size  and  shape  vary  only  slightly  under  stress,  changes  in 
temperature,  and  the  like.  Thus  it  is  natural  to  consider  the  idealization  of 
a  perfectly  rigid  body,  whose  size  and  shape  are  permanently  fixed.  Such  a 
body  may  be  characterized  by  the  requirement  that  the  distance  between 
any  two  points  of  the  body  remains  fixed.  In  this  chapter,  we  shall  be 
concerned  with  the  mechanics  of  rigid  bodies. 

9.1    Basic  Principles 

It  will  be  convenient  to  simplify  the  notation  of  the  previous  chapter  by 
omitting  the  particle  label  i  from  sums  over  all  particles  in  the  rigid  body. 
Thus,  for  example,  we  shall  write 

P  =  ^mr,        and       J  =  ^mrAf, 

in  place  of  (8.3)  and  (8.11). 

The  motion  of  the  centre  of  mass  of  the  body  is  completely  specified  by 
(8.6): 

P  =  MR=J2F-  (9-1) 

Our  main  interest  in  this  chapter  will  be  centred  on  the  rotational  motion 
of  the  rigid  body.  For  the  moment,  let  us  assume,  as  we  originally  did  in 
§8.3,  that  the  internal  forces  are  central.  Then,  according  to  (8.14), 

j  =  ^rAF.  (9.2) 
197 
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We  shall  see  later  that  these  two  equations  are  sufficient  to  determine  the 
motion  completely. 

One  very  important  application  of  (9.1)  and  (9.2)  should  be  noted.  For 
a  rigid  body  at  rest,  r  =  0  for  every  particle,  and  thus  both  P  and  J 
vanish.  Clearly,  the  body  can  remain  at  rest  only  if  the  right  hand  sides 
of  both  (9.1)  and  (9.2)  vanish,  i.e.,  if  the  sum  of  the  forces  and  the  sum 
of  their  moments  are  both  zero.  In  fact,  as  we  shall  see,  this  is  not  only  a 
necessary,  but  also  a  sufficient,  condition  for  equilibrium. 

Under  the  same  assumption  of  central  internal  forces,  we  saw  in  §8.5 
that  the  internal  forces  do  no  work,  so  that 


This  might  at  first  sight  appear  to  be  a  third  independent  equation.  How- 
ever, we  shall  see  later  that  it  is  actually  a  consequence  of  the  other  two. 
It  is  of  course  particularly  useful  in  the  case  when  the  external  forces  are 
conservative,  since  it  then  leads  to  the  conservation  law 


where  V  is  the  external  potential  energy,  previously  denoted  by  V^xt-  (Note 
that  there  is  no  mention  here  of  internal  potential  energy;  in  a  rigid  body, 
this  does  not  change.) 

The  assumption  that  the  internal  forces  are  central  is  much  stronger 
than  it  need  be.  Indeed,  as  we  discussed  in  §8.3,  it  could  not  be  justified 
from  our  knowledge  of  the  internal  forces  in  real  solids,  which  are  certainly 
not  exclusively  central,  and  in  any  case  cannot  be  adequately  described 
by  classical  mechanics.  All  we  actually  require  is  the  validity  of  the  basic 
equations  (9.1)  and  (9.2),  and  it  is  better  to  regard  these  as  basic  assump- 
tions of  rigid  body  dynamics,  whose  justification  lies  in  the  fact  that  their 
consequences  agree  with  observation.  To  some  extent,  the  success  of  these 
postulates,  particularly  (9.2),  may  seem  rather  fortuitous.  However,  we 
shall  see  in  Chapter  12  that  it  is  closely  related  to  the  relativity  principle 
discussed  in  Chapter  1. 

9.2    Rotation  about  an  Axis 

Let  us  now  apply  these  basic  equations  to  a  rigid  body  which  is  free  to 
rotate  only  about  a  fixed  axis,  which  for  simplicity  we  take  to  be  the  z-axis. 


(9.3) 


T  +  V  =  E  =  constant, 


(9.4) 
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We  also  choose  the  origin  on  this  axis,  positioned  so  that  the  z  co-ordinate 
of  the  centre  of  mass  is  zero. 

In  cylindrical  polars,  the  z  and  p  co-ordinates  of  every  point  are  fixed, 
while  the  ip  co-ordinate  varies  according  to  ip  =  ui,  the  angular  velocity  of 
the  body  (not  necessarily  constant).  We  examine  first  the  component  of 
angular  momentum  about  the  axis  of  rotation.  By  (3.29),  it  is 


is  the  moment  of  inertia  about  the  axis.  Since  /  is  obviously  constant,  the 
2  component  of  (9.2)  yields 


This  equation  determines  the  rate  of  change  of  angular  velocity,  and  may 
be  called  the  equation  of  motion  of  the  rotating  body. 

The  condition  for  equilibrium  here  is  simply  that  the  right  hand  side  of 
(9.7)  vanishes,  i.e.,  that  the  forces  have  no  net  moment  about  the  2-axis. 
For  example,  consider  a  rectangular  lamina  (i.e.,  a  plane,  two-dimensional 
object)  of  size  a  x  b  and  negligible  mass,  pivoted  at  one  corner,  carrying  a 
weight  Mg  and  supported  by  a  horizontal  force  F,  as  shown  in  Fig.  9.1.  The 
total  moment  here  is  bF—aMg,  so  for  equilibrium  we  require  F  =  (a/b)Mg. 


(9.5) 


where 


(9.6) 


(9.7) 


F 


b 


Mg 


Fig.  9.1 


The  kinetic  energy  may  also  be  expressed  in  terms  of  I.  For 


(9.8) 
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The  equation  (9.3)  is,  therefore, 

T  =  Iojuj  =  ^(p<p)Fv  =  u^pFv>. 

This  is  clearly  a  consequence  of  (9.7),  and  gives  us  no  additional  informa- 
tion. 

The  momentum  equation  (9.1)  serves  to  determine  the  reaction  at  the 
axis,  which  has  of  course  no  moment  about  the  axis,  and  so  does  not  appear 
in  (9.7).  Let  us  denote  the  force  on  the  body  at  the  axis  by  Q,  and  separate 
this  from  all  the  other  forces  on  the  body.  Then  from  (9.1) 

P  =  MR=Q  +  Y,F  (9-9) 

For  example,  in  the  case  illustrated  in  Fig.  9.1,  the  equilibrium  condition 
P  =  0  immediately  yields  Q  =  (F,Mg,0).  (Note  that  the  z-axis  here  is 
horizontal.) 

The  centre  of  mass  is  of  course  fixed  in  the  body,  so  that,  by  (5.2), 
R  =  uj  A  R.  Differentiating  again  to  find  the  acceleration  of  the  centre  of 
mass,  we  obtain 

R  =  6jAR  +  ujAR  =  ujAR  +  uA(uAR).  (9.10) 

The  first  term  is  the  tangential  acceleration,  Rio  in  the  <p  direction,  and  the 
second  the  radial  acceleration,  —lo2R  in  the  p  direction.  (Compare  (3.47).) 
Together,  equations  (9.9)  and  (9.10)  serve  to  determine  Q. 

As  a  simple  example,  let  us  consider  a  compound  pendulum  —  a  rigid 
body  pivoted  about  a  horizontal  axis,  and  moving  under  gravity. 

Example:  Compound  pendulum 

Find  the  period  of  small  oscillations  of  a  compound  pendulum, 
and  the  reaction  at  the  pivot  as  a  function  of  angle. 

To  be  consistent,  we  shall  still  take  the  z-axis  to  be  the  axis  of 
rotation,  and  choose  the  :r-axis  vertically  downward.  (See  Fig.  9.2.) 
Then  the  force  (the  weight)  acts  at  the  centre  of  mass,  and  has  the 
components  F  =  (Mg, 0,0).  Thus  the  equation  of  motion  (9.7)  is 


I (p  =  —MgR  sin  (p. 


(9.11) 


Rigid  Bodies 


201 


Fig.  9.2 

This  is  identical  with  the  equation  of  motion  of  a  simple  pendulum  of 
length  I  =  I /MR.  (This  is  the  length  of  the  equivalent  simple  pendulum) 
Thus  the  period  of  small  oscillations  is  2-K^Jl/ g  =  2tt  I /MgR.  As  we 
shall  see,  I  is  always  larger  than  the  distance  R  from  the  pivot  to  the 
centre  of  mass. 

The  energy  conservation  equation  is 

T  +  V  =  \lip2  -  MgRcos  <p  =  E  =  constant.  (9.12) 

It  may  be  obtained  from  (9.11)  by  multiplying  by  ip  and  integrating.  As 
usual,  E  is  determined  by  the  initial  conditions,  and  (9.12)  then  serves 
to  fix  the  angular  velocity  ip  for  any  given  inclination  ip. 

The  components  of  the  reaction  Q,  obtained  from  (9.9),  are 

Qz  =0 

Qp  =  -Mg  cos  tp  -  MRip2, 
Qv  =  M  g  sin  ip  +  MRip. 

Thus,  substituting  for  (p  in  terms  of  p  using  (9.11),  and  for  ip  in  terms 
of  ip  by  (9.12),  we  find 

Qz  =0 

Qp  =  -Mg(l  +  2R/1)  cosp  -  2E/1, 
Qv  =  Mg(l  -  R/l)sinip. 
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Note,  however,  that  this  result  is  not  the  same  as  the  reaction  at  the 
pivot  of  a  simple  pendulum  of  length  I. 

As  a  rather  different  illustration  of  the  use  of  these  equations,  let  us 
consider  the  effect  of  a  sudden  blow. 

Example:  Effect  of  sudden  blow 

Suppose  that  the  pendulum  is  initially  in  equilibrium  and  is  given 
a  sharp  horizontal  blow  of  impulse  if  at  a  distance  d  below 
the  axis  (see  Fig.  9.3).  For  what  value  of  d  does  the  impulsive 
reaction  at  the  pivot  vanish? 


Fig.  9.3 


From  (9.11)  integrated  over  a  short  time  interval,  we  learn  that  the 
angular  velocity  u  just  after  the  blow  is  given  by  Iu>  =  dK.  As  before, 
(9.9)  serves  to  fix  the  reaction  at  the  axis,  in  this  case  the  impulsive 
reaction,  S  say.  Since  the  velocity  of  the  centre  of  mass  immediately 
after  the  blow  is  loR,  the  integrated  form  of  (9.9)  is 

MujR  =-S  +  K, 

whence 

MdR\ 


K. 


I  ) 

The  interesting  thing  about  this  result  is  that  it  is  possible  to  make 
S  vanish  by  choosing  d  equal  to  the  length  of  the  equivalent  simple 
pendulum,  d  =  I/MR  =  I.  This  is  the  ideal  point  at  which  to  hit  a  ball 
with  a  bat  (sometimes  called  the  'sweet  spot'  in  sporting  applications  — 
cricket,  tennis,  baseball,  etc.). 
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9.3    Perpendicular  Components  of  Angular  Momentum 

We  now  consider  the  remaining  components  of  the  angular  momentum  vec- 
tor. We  shall  see  that  they  give  us  further  information  about  the  reaction 
on  the  axis. 

In  Cartesian  co-ordinates,  the  velocity  of  a  point  r  of  the  body  is  given 

by 

x  =  —toy,       y  =  u>x,       z  =  0.  (9.13) 

Thus  we  find 

Jx=Yj  m(-zy)  =  -  ]T  mxzuj, 
Jy  =J2  m(zx)    =  —  myzuj. 

We  can  write  all  three  components  of  J  in  the  form 

Jx  =  IxzV,  Jy  =  IyzU,  Jz  =  hzU,  (9-14) 

where 

hz  =  -  'Ylmxz,    Iyz  =  -^2'rnyz,    Izz  =  ^  m(x2  +  y2).  (9.15) 

Here  Izz  is  the  moment  of  inertia  about  the  z-axis,  previously  denoted  by 
/.  The  quantities  Ixz  and  Iyz  are  called  products  of  inertia. 

At  first  sight  it  may  seem  surprising  that  J  has  components  in  directions 
perpendicular  to  w.  A  simple  example  may  help  to  clarify  the  reason  for 
this.  Consider  a  light  rigid  rod  with  equal  masses  m  at  its  two  ends,  rigidly 
fixed  at  an  angle  6  to  an  axis  through  its  mid-point.  (See  Fig.  9.4.)  If  the 
positions  of  the  masses  are  r  and  —r,  the  total  angular  momentum  is 

J  =  rnr  A  r  +  m(—r)  A  (— r)  =  2rnr  A  (uj  A  r). 

Clearly,  J  is  perpendicular  to  r,  as  shown  in  the  figure.  When  the  rod  is 
in  the  :rz-plane,  the  masses  are  moving  in  the  ±y  directions,  and  there  is 
a  component  of  angular  momentum  about  the  x-axis  as  well  as  one  about 
the  z-axis. 

In  this  example,  the  centre  of  mass  lies  on  the  axis.  Thus  if  no  external 
force  is  applied,  the  total  reaction  on  the  axis  is  zero.  There  is,  however, 
a  resultant  couple  on  the  axis  (here  represented  by  the  forces  F'),  which 
is  required  to  balance  the  couple  produced  by  the  centrifugal  forces.  The 
magnitude  of  this  couple  may  be  determined  from  the  remaining  pair  of  the 
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Fig.  9.4 

equations  (9.2).  From  (9.15),  we  see  that,  unlike  Izz  the  products  ol  inertia 
are  not  constants,  in  general.  Using  (9.13),  we  find 

Ixz  =  ~LdIyz:  lyz  =  ^^izi  Izz  =  0- 

Thus  il  G  is  the  couple  on  the  axis,  we  have  from  (9.2) 

jx  =  IxzCo  -  Iyzuj2  =  GX  +  ^2(yFz  -  zFy), 

jy  =  lyzU  +  Ixz^2  =  Gy+  ^{zFx  ~xFz). 

If  there  are  no  other  external  forces,  then  a;  is  a  constant,  and  G  precisely 
balances  the  centrifugal  couple.  When  the  rod  is  in  the  zz-plane,  the  only 
non-vanishing  component  of  G  is  the  moment  about  the  y-axis, 

Gy  =  dF'x  =  Ixzlj2  =  —  2mr2uj2  sin  8  cos  9. 


This  determines  F '. 
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9.4    Principal  Axes  of  Inertia 


We  have  seen  that,  in  general,  the  angular  momentum  vector  J"  is  in  a 
different  direction  from  the  angular  velocity  vector  ui.  There  are  special 
cases,  however,  in  which  the  products  of  inertia  Ixz  and  Iyz  vanish.  Then 
J  is  also  in  the  z  direction.  In  that  case,  the  z-axis  is  called  a  principal  axis 
of  inertia.  When  a  body  is  rotating  freely  about  a  principal  axis  through 
its  centre  of  mass,  there  is  no  resultant  force  or  couple  on  the  axis. 

The  z-axis  is  a  principal  axis,  in  particular,  if  the  icy-plane  is  a  plane  of 
reflection  symmetry;  for  then  the  contributions  to  the  products  of  inertia  Ixz 
and  Iyz  from  any  point  (x,  y,  z)  is  exactly  cancelled  by  that  from  (a;,  y,  —z). 
Similarly,  it  is  a  principal  axis  if  it  is  an  axis  of  rotational  symmetry,  for 
then  the  contribution  from  (x,  y,  z)  is  cancelled  by  that  from  (— x,  —y,  z). 

For  bodies  with  three  symmetry  axes  —  such  as  a  rectangular  paral- 
lelepiped or  an  ellipsoid  —  there  are  obviously  three  perpendicular  principal 
axes,  and  we  shall  see  later  that  this  is  true  more  generally.  In  this  case,  it 
is  clearly  an  advantage  to  choose  these  as  our  co-ordinate  axes.  So  we  shall 
no  longer  assume  that  the  axis  of  rotation  is  the  z-axis,  but  take  it  to  have 
an  arbitrary  inclination.  Then  oj  has  three  components  (ujx,  ujy,ujz). 

The  angular  momentum  J  may  be  written,  using  (A.  16),  in  the  form 


J  =       mr  A(uAr)  =       m[r2u>  —  (r  ■  u>)r]. 


(9.16) 


Its  components  are  therefore  linear  functions  of  the  components  of  w,  which 
we  may  write  in  matrix  notation  as 


J X 

* XX 

Ixy 

Jy 

lyx 

lyy 

Jz. 

Jzx 

Izy 

~UX~ 

U)y 

Uz_ 

(9.17) 


that  is,  Jx  =  Ixxlox  +  IxyLOy  +  Ixzuz,  and  two  similar  equations.  The 
nine  elements  Ixx,  Ixyi . . . ,  Izz  of  the  3x3  matrix  may  be  regarded  as  the 
components  of  a  single  entity  I,  in  much  the  same  way  as  the  three  quantities 
JX,JV,  Jz  are  regarded  as  the  components  of  the  vector  J.  The  entity  I  is 
called  a  tensor,  in  this  case  the  inertia  tensor.  The  elementary  properties 
of  tensors  are  described  in  §A.9.  However,  we  shall  need  to  use  only  one 
result  from  this  general  theory. 

It  is  obvious  from  the  definition  (9.15)  that  the  products  of  inertia  sat- 


isfy symmetry  relations  like  Ixy 


lyx- 


The  3x3  matrix  in  (9.17)  is 


therefore  unchanged  by  reflection  in  the  leading  diagonal.  A  tensor  I  with 
this  property  is  called  symmetric. 
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If  the  three  co-ordinate  axes  are  all  axes  of  symmetry,  then  all  the 
products  of  inertia  vanish,  and  I  has  the  diagonal  form 


0  0 

Iyy  0 

o  /„ 


In  this  case,  the  relations  (9.17)  simplify  to 


Jy  ~  lyy^yi 


Jz 


(9.18) 


Tims  J  is  parallel  to  w  if  o>  is  along  any  one  of  the  three  symmetry  axes, 
but  not  in  general  otherwise. 

It  is  shown  in  §A.10  that  for  any  given  symmetric  tensor  one  can  always 
find  a  set  of  axes  with  respect  to  which  it  is  diagonal.  Thus  for  any  rigid 
body,  we  can  find  three  perpendicular  axes  through  any  given  point  which 
are  principal  axes  of  inertia.  It  will  be  convenient  to  introduce  three  unit 
vectors  ei,  en,  e3  along  these  axes.  Then,  if  we  write 


u>  =  wiei  +  uj2e2  +  iv3e3, 


(9.19) 


the  components  of  J  in  these  three  directions  will  be  obtained  by  multi- 
plying the  components  of  lj  by  the  appropriate  moments  of  inertia,  as  in 
(9.18).  Thus  we  obtain 

J  =  hiviei  +  h^2&2  +  h^zez-  (9.20) 


The  three  diagonal  elements  of  the  inertia  tensor,  Ii,h,  h  are  called  prin- 
cipal moments  of  inertia.  We  shall  always  use  a  single  subscript  for  the 
principal  moments,  to  distinguish  them  from  moments  of  inertia  about  ar- 
bitrary axes. 

It  is  important  to  realize  that  the  principal  axes  are  fixed  in  the  body, 
not  in  space,  and  therefore  rotate  with  it.  It  is  often  convenient  to  use 
these  axes  to  define  our  frame  of  reference,  particularly  since  the  principal 
moments  h,h,  h  are  constants.  This  is,  however,  a  rotating  frame,  not  an 
inertial  one. 

The  kinetic  energy  may  also  be  expressed  in  terms  of  the  angular  velocity 
and  the  inertia  tensor.  We  have 
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by  a  standard  formula  of  vector  algebra  (see  Appendix  A,  Problem  7). 
Comparing  with  (9.16),  we  see  that 

T=±w-J.  (9.21) 

(Equations  (9.5)  and  (9.8)  provide  a  special  case  of  this  general  result.) 
Thus  from  (9.19)  and  (9.20),  we  find 

T  =  \hu\  +  \I2loI  +  ±/3c4  (9.22) 

These  equations  for  rotational  motion  may  be  compared  with  the  cor- 
responding ones  for  translational  motion  with  velocity  v,  namely  T  = 
iM«2  =  ^v-P.  The  principal  difference  is  that  the  mass,  unlike  the  inertia 
tensor,  has  no  directional  properties,  so  that  the  coefficients  of  v\,  Vy  and 
v\  are  all  equal. 

Symmetric  bodies 

The  term  symmetric  applied  to  a  rigid  body  has  a  special,  technical  signif- 
icance. It  means  that  two  of  the  principal  moments  of  inertia  coincide,  say 
Ii  =  This  is  true  in  particular  if  the  third  axis  e.%  is  a  symmetry  axis  of 
appropriate  type.  It  may  be  an  axis  of  cylindrical  symmetry  —  for  exam- 
ple, the  axis  of  a  spheroid  or  a  circular  cylinder  or  cone.  More  generally, 
I\  =  I2  if  ez  is  an  axis  of  more  than  two-fold  rotational  symmetry  —  for 
example  the  axis  of  an  equilateral  triangular  prism,  or  a  square  pyramid. 
(In  the  case  of  the  prism,  this  is  not  entirely  obvious.  It  follows  from  the 
fact  that  one  cannot  find  a  pair  of  perpendicular  axes  e.\  and  e2  which  are 
in  any  way  distinguished  from  other  possible  pairs.) 
For  a  symmetric  body,  (9.20)  becomes 

J  =  7i(wiei  +  uj2e2)  +  huj-Aez.  (9.23) 

Note  that  if  ^3  =  0,  then  J  =  Iiu>,  so  J  is  parallel  to  u>.  In  other  words, 
any  axis  in  the  plane  of  e\  and  e2  is  a  principal  axis.  Assuming  that 
I3  7^/1,  the  symmetry  axis  e^  is  of  course  uniquely  determined,  but  the 
axes  ei  and  e2  are  not.  We  may  choose  them  to  be  any  pair  of  orthogonal 
axes  in  the  plane  normal  to  63.  Indeed,  they  need  not  even  be  fixed  in  the 
body,  so  long  as  they  always  remain  perpendicular  to  e%  and  to  each  other. 
Equation  (9.23)  will  still  hold  so  long  as  this  condition  is  satisfied.  This 
freedom  of  choice  will  prove  to  be  very  useful  later. 

It  can  happen  that  all  three  principal  moments  of  inertia  are  equal, 
h  =  h  =  h-  This  is  the  case  for  a  sphere  (with  origin  at  the  centre),  a  cube, 
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or  a  regular  tetrahedron  (or,  indeed,  any  of  the  five  regular  solids).  It  may 
of  course  also  happen  by  accidental  coincidence  of  the  three  values.  We  may 
call  such  bodies  totally  symmetric.  They  have  the  property  that  J  =  I\uj 
whatever  direction  uj  may  take,  so  that  every  axis  is  a  principal  axis,  and 
the  choice  of  the  orthonormal  triad  e±,  e?,  e3  is  completely  arbitrary. 


9.5    Calculation  of  Moments  of  Inertia 


The  moments  and  products  of  inertia  of  any  body  with  respect  to  a  given 
origin  may  be  calculated  from  the  definitions  (9.15).  For  continuous  distri- 
butions of  matter,  we  must  replace  sums  by  integrals: 

hx  =  jjj  P(r)(y2  +  z2)  d3r,       Ixy  =  UJ  p(r)(-xy)  d3r,  (9.24) 

etc.,  where  p(r)  is  the  density. 


Shift  of  origin 

When  the  rigid  body  is  pivoted  so  that  one  point  is  fixed,  it  is  convenient  to 
choose  that  point  to  be  the  origin.  If  there  is  no  fixed  point,  we  generally 
choose  the  origin  to  be  the  centre  of  mass.  Thus  it  is  useful  to  be  able 
to  relate  the  moments  and  products  of  inertia  about  an  arbitrary  point  to 
those  about  the  centre  of  mass. 

As  usual,  we  shall  distinguish  quantities  referred  to  the  centre  of  mass  as 
origin  by  an  asterisk.  To  find  the  desired  relations,  we  substitute  in  (9.15) 
the  relation  (8.16),  r  =  R  +  r*,  and  use  the  conditions  (8.17),  namely 

mx*  =       my*  =       rnz*  =  0. 

Because  of  these  relations,  the  cross  terms  between  R  and  r*  drop  out, 
exactly  as  they  did  in  (8.18)  or  (8.29).  For  example, 

ixy  =  -YJm(x  +  x*)(Y  +  v*)  =  -MXY-J2mx*y*- 

The  last  term  here  is  just  the  product  of  inertia  I*  referred  to  the  centre 
of  mass  as  origin.  Thus  we  obtain  relations  of  the  form 

Ixx  =  M(Y2  +  Z2)  +  Fxx,       Ixy  =  -MXY  +  I*  (9.25) 


etc.  Note  that,  as  in  previous  cases,  the  components  of  the  inertia  tensor 
with  respect  to  an  arbitrary  origin  are  obtained  from  those  with  respect  to 
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the  centre  of  mass  by  adding  the  contribution  of  a  particle  of  mass  M  at 
R.  (For  the  moments  of  inertia,  this  is  known  as  the  parallel  axes  theorem.) 
Because  of  this  result,  it  is  only  necessary,  for  any  given  body,  to  compute 
the  moments  and  products  of  inertia  with  respect  to  the  centre  of  mass; 
those  with  respect  to  any  other  origin  are  then  given  by  (9.25). 

It  is  important  to  realize  that  the  principal  axes  at  a  given  origin  are 
not  necessarily  parallel  to  those  at  the  centre  of  mass.  If  we  choose  the 
axes  at  the  centre  of  mass  to  be  principal  axes,  then  the  products  of  inertia 
I*y,  I*z,  I*z  will  all  be  zero,  but  it  is  clear  from  (9.25)  that  this  does  not 
necessarily  imply  that  Ixy,  Ixz,  Iyz  are  zero.  In  fact,  this  will  be  true  if, 
and  only  if,  the  chosen  origin  lies  on  one  of  the  principal  axes  through  the 
centre  of  mass,  so  that  two  of  the  three  co-ordinates  X,  Y,  Z  are  zero. 


Routh's  rule 

There  is  a  simple  and  convenient  formula  that  allows  us  to  write  down 
at  once  the  moments  of  inertia  of  a  large  class  of  bodies.  We  consider 
bodies  with  uniform  density  p  and  three  perpendicular  symmetry  planes. 
The  principal  axes  are  then  obvious,  and  we  shall  take  them  to  be  the 
co-ordinate  axes. 

From  (9.24),  we  see  that  the  principal  moments  of  inertia  may  be  written 
in  the  form 

I*=Ky  +  Kz,       i;=Kx  +  Kz,       i;=Kx  +  Ky, 
where,  for  example, 

Kz  =  JJJ  pz2  dxdydz, 
integrated  over  the  volume  V  of  the  body.  The  mass  of  the  body  is  of  course 

M  =  /  /  /  p dxdydz. 


J  J  Jv 

Now  let  us  denote  the  lengths  of  the  three  co-ordinate  axes  by  2a,  26,  2c, 
respectively,  and  consider  together  all  bodies  of  the  same  type  {e.g.,  ellip- 
soids, or  rectangular  parallelepipeds)  but  with  different  values  of  a,  b,  c.  It 
is  then  easy  to  find  how  M  and  Kz  depend  on  these  three  lengths.  Making 
the  substitution  x  =  a£,y  =  brj,  z  =  c£,  we  see  that 

M  =  pabc  III  d£d77dC, 
J  J  JVo 
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where  Vq  is  a  standard  body  of  this  type  with  p  =  1  and  a  =  b  =  c  =  1. 
Hence  M  oc  pafoc.  In  a  similar  way,  Kz  oc  pabc3.  It  follows  that  7fz  = 
AzMc2,  where  Az  is  a  dimensionless  number,  the  same  for  all  bodies  of  this 
type.  Hence  we  obtain  Routh's  rule,  which  asserts  that 

/J  =  M(Xyb2  +  Xzc2), 

72*  =  M(Xxa2  +  Azc2),  (9.26) 
i;  =  M(Xxa2  +  Xyb2). 

The  values  of  the  constants  Xx,  Xy,  Az  may  be  found  by  examining  the 
standard  volume  Vo-  For  ellipsoids,  for  example,  Vq  is  a  sphere  of  unit 
radius,  for  which 

K*o  =  Jjjv(2dtidVd(  =  |\Ml-C2)dC=-^. 

Since  also  Mq  =  47r/3,  we  have  Az  =  Kzq>/Mq  =  |.  Thus  for  an  ellipsoid 

Ax  =  Xy  =  Xz  =  -.  (ellipsoid) 
5 

A  similar  calculation  shows  that  for  a  rectangular  parallelepiped 

Ax  =  Aj,  =  Az  =  -.  (parallelepiped) 
o 

As  an  example  of  a  body  for  which  the  three  constants  are  not  all  equal, 
we  may  quote  the  elliptic  cylinder  of  length  2c,  for  which 

Ax  =  Xy  =  -       Xz  =  -.  (cylinder) 

All  three  examples  may  be  summarized  by  saying  that  A  is  1/3  for  'rectan- 
gular' axes,  1/4  for  'elliptic'  ones,  and  1/5  for  'ellipsoidal'  ones. 

This  formula  covers  several  special  cases  of  particular  interest.  A  sphere 
is  of  course  an  ellipsoid  with  a  =  b  =  c,  and  each  of  its  three  principal 
moments  of  inertia  is  |Ma2.  Similarly  a  cube  is  a  parallelepiped  with 
a  =  b  =  c,  and  each  of  its  principal  moments  is  |Ma2.  A  parallelepiped 
with  c  =  0  is  a  flat  rectangular  plate.  Its  principal  moments  are 


7*  =  1-Mb2,  IX  =  ]-Ma2,  U  =  \M(a2  +  b2). 
1      3  2      3  3      3     v  ; 
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Similarly,  a  flat  circular  plate  is  a  cylinder  with  a  =  b  and  c  =  0.  In  that 
case, 

I*  =  I*  =  \Ma2,       K  =  l-Ma2. 
1       2     4  3  2 

Finally,  a  thin  rod  is  a  limiting  case  of  either  a  parallelepiped  or  a  cylinder 
with  a  =  b  =  0.  Its  principal  moments  are 

i*  =  r2  =  Imc2,     j3*  =  o. 


9.6    Effect  of  a  Small  Force  on  the  Axis 

So  far  we  have  assumed  that  the  axis  about  which  the  rigid  body  is  rotating 
is  fixed.  We  now  go  on  to  consider  how  it  moves  if  only  one  point  on  the 
axis  is  fixed.  As  we  shall  see,  rapidly  rotating  bodies  have  great  stability, 
which  forms  the  basis,  for  example,  of  the  gyroscope. 

We  suppose  that  the  rigid  body  is  free  to  rotate  about  a  fixed  smooth 
pivot,  and  that  initially  it  is  rotating  freely  about  a  principal  axis,  say  ea. 
Then,  if  the  angular  velocity  is  u;  =  Loe^,  the  angular  momentum  will  be 
J  =  I'scoe^.  So  long  as  no  external  force  acts  on  the  body  the  angular 
momentum  equation  (9.2)  tells  us  that 

j  =  I3u  =  0, 

so  the  axis  will  remain  fixed  in  space  and  the  angular  velocity  will  be 
constant.  (This  would  not  be  true  if  the  axis  of  rotation  were  not  a  principal 
axis.  We  shall  see  what  happens  in  that  case  later.) 

Now  suppose  that  a  small  force  F  is  applied  to  the  axis  at  a  point  r 
(see  Fig.  9.5).  Then  the  equation  of  motion  becomes 

J  =  rAF.  (9.27) 

This  force  will  cause  the  axis  to  change  direction,  and  the  body  will  there- 
fore acquire  a  small  component  of  angular  velocity  perpendicular  to  its  axis 
ez-  However,  provided  the  force  is  small  enough,  the  angular  velocity  with 
which  the  axis  moves  will  be  small  in  comparison  to  the  angular  veloc- 
ity of  rotation  about  the  axis.  In  that  case,  we  may  neglect  the  angular 
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Fig.  9.5 

momentum  components  normal  to  the  axis,  and  again  write 

j  =        =  r  A  F.  (9.28) 

Since  the  moment  r  AF  is  perpendicular  to  u>,  the  magnitude  of  the  angular 
velocity  does  not  change.  (Recall  that  d(u;2)/d£  =  2w  •  w.)  However,  its 
direction  does  change,  and  the  axis  will  move  in  the  direction  of  r  A  F,  that 
is  perpendicular  to  the  applied  force. 

The  effect  may  be  rather  surprising  at  first  sight.  However,  it  is  easy  to 
see  it  in  action  in,  for  example,  a  freely  rolling  bicycle  wheel  (see  Fig.  9.6). 
If  the  wheel  leans  to  the  right,  it  will  not  necessarily  fall  over,  as  it  would 
if  it  were  not  rotating,  but  will  turn  to  the  right.  The  force  F  in  this  case 
is  the  force  of  gravity,  which  produces  a  moment  about  a  horizontal  axis, 
as  shown  in  Fig.  9.6.  Thus,  in  a  small  time  interval  dt,  the  angular  velocity 
w  of  a  wheel  acquires  a  small  additional  horizontal  component,  do>.  This 
shows  that  the  axis  of  the  wheel  must  change  direction. 

When  turning  a  corner  on  a  bicycle,  this  effect  plays  a  role,  but  there  are 
other  more  important  effects,  related  to  the  way  the  handlebars  and  front 
fork  are  pivoted.  (See  Jones,  Physics  Today,  23,  34-40,  1970  and  Fajans, 
American  Journal  of  Physics,  68,  654-659,  2000.) 

Another  familiar  example  is  the  spinning  top,  or  toy  gyroscope  (which 
we  shall  discuss  in  more  detail  in  Chapters  10  and  12).  The  force  in  this 
case  is  again  the  force  of  gravity,  F  =  —Mgk,  acting  at  the  centre  of  mass 
position,  R  =  Re3  (see  Fig.  9.7).  Thus  (9.28)  yields 
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Fig.  9.7 

Isujes  =  -MgRe:i  A  k. 

This  equation  is  of  a  form  we  have  encountered  several  times  already.  It 
may  be  written 


e3  =  ft  A  e3, 
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where 

n  =  ^k 

Tims  (see  §5.1)  it  describes  a  vector  e%  rotating  with  constant  angular 
velocity  fi  about  the  vertical.  The  effect  is  similar  to  the  Larmor  precession 
(see  §5.6)  or  the  precession  of  a  satellite  orbit  (see  §6.5).  Note  that  the 
angular  velocity  ft  is  independent  of  the  inclination  of  the  axis. 

Our  treatment  here  is  valid  only  if  f2  <C  u),  or,  equivalently,  if  MgR  <C 
I3UJ2.  In  other  words,  we  require  the  rotational  kinetic  energy  (T  =  ^I^uj2) 
to  be  much  greater  than  the  possible  change  in  gravitational  potential  en- 
ergy (of  order  MgR) .  We  shall  discuss  what  happens  when  this  condition 
is  not  satisfied  later  (see  §10.3  and  §12.4). 

Note  that  the  precessional  angular  velocity  (9.29)  is  inversely  propor- 
tional both  to  the  moment  of  inertia  and  the  rotational  angular  velocity  u. 
Tims,  to  minimize  the  effect  of  a  given  force,  we  should  use  a  fat,  rapidly 
spinning  body. 

The  great  stability  of  rapidly  rotating  bodies  is  the  basis  of  the  gyro- 
scope. Essentially,  this  consists  of  a  spinning  body  suspended  in  such  a  way 
that  its  axis  is  free  to  rotate  relative  to  its  support.  The  bearings  are  made 
as  nearly  frictionless  as  possible,  to  minimize  the  torques  on  the  gyroscope. 
Then,  no  matter  how  we  turn  the  support,  the  axis  of  the  gyroscope  will 
remain  pointing  very  closely  to  the  same  direction  in  space. 

The  gyroscope  is  particularly  useful  for  navigational  instruments  in  air- 
craft. It  may  be  employed,  for  example,  to  provide  an  'artificial  horizon' 
which  allows  the  pilot  to  fly  a  level  course  even  in  cloud  (where  our  normal 
sense  of  balance  is  notoriously  unreliable).  A  similar  instrument,  designed 
to  show  the  orientation  in  a  horizontal  plane,  serves  as  a  direction  indicator. 

Precession  of  the  equinoxes 

The  non-spherical  shape  of  the  Earth  leads  to  another  example  of  the  phe- 
nomenon of  precession.  The  effect  is  closely  related  to  the  precession  of  a 
satellite  orbit  discussed  in  §6.5.  Since  the  Earth  exerts  a  moment  on  the 
satellite,  the  satellite  exerts  an  equal  and  opposite  moment  on  the  Earth. 
For  a  small  satellite,  this  is  of  course  quite  negligible,  but  a  similar  moment 
is  exerted  by  both  the  Sun  and  the  Moon,  and,  although  the  effect  is  still 
quite  small,  it  leads  to  observable  changes  over  long  periods  of  time. 


(9.29) 
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Let  us  consider  for  example  the  effect  of  the  Sun.  Because  it  attracts 
the  nearer  equatorial  bulge  more  strongly  that  the  farther  one,  it  exerts  a 
couple  on  the  Earth's  axis,  in  a  direction  which  would  tend  (if  the  Earth 
were  not  rotating)  to  align  it  with  the  normal  to  the  plane  of  the  Earth's 
orbit  (called  the  ecliptic  plane,  because  it  is  the  plane  in  which  eclipses 
can  occur.)  (See  Fig.  9.8.)  However,  because  of  the  Earth's  rotation,  the 


Sun 


Fig.  9.8 


actual  effect  is  to  make  the  axis  precess  around  the  normal  to  the  ecliptic 
plane.  Since  the  equator  is  inclined  to  the  ecliptic  at  about  23.45°,  the  axis 
therefore  describes  a  cone  in  space  with  semi- vertical  angle  23.45°. 

It  turns  out  that  this  effect  depends  on  the  same  parameter,  (6.41)  or 
(6.42),  that  appeared  in  the  discussion  of  the  tides.  Thus,  as  in  that  case, 
the  effect  of  the  Moon  is  rather  more  than  twice  as  large  as  that  of  the  Sun 
(though  somewhat  complicated  by  the  fact  that  the  Moon's  orbital  plane  is 
not  fixed  in  space).  The  two  bodies  together  lead  to  a  precessional  angular 
velocity  of  about  50"  per  year,  which  corresponds  to  a  precessional  period 
of  26  000  years.  The  effect  means  that  the  pole  of  the  Earth's  axis  moves 
relative  to  the  fixed  stars,  so  that  13  000  years  ago  our  present  pole  star 
was  some  47°  away  from  the  pole.  (The  irregularity  of  the  Moon's  motion, 
which  is  due  to  the  moment  exerted  by  the  Sun  on  the  Earth-Moon  system, 
leads  to  a  superimposed  wobble  of  the  axis  with  an  amplitude  of  9"  and  a 
period  of  18.6  years.) 

The  equinoxes  occur  when  the  apparent  direction  of  the  Sun  crosses  the 
equator.  The  points  on  the  ecliptic  where  this  happens  also  precess  with  the 
26  000-year  period,  which  is  why  this  phenomenon  is  known  as  'precession 
of  the  equinoxes'. 
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9.7    Instantaneous  Angular  Velocity 

We  shall  now  turn  to  the  problem  of  the  general  motion  of  a  rigid  body, 
without  assuming  that  it  is  spinning  about  a  principal  axis,  or  that  the 
forces  acting  are  small.  In  this  section,  we  shall  discuss  the  concept  of 
angular  velocity,  and  show  that,  no  matter  how  the  body  is  moving,  it  is 
always  possible  to  define  an  instantaneous  angular  velocity  vector  w  (which 
in  general  is  constant  in  neither  direction  nor  magnitude). 

We  consider  first  a  rigid  body  free  to  rotate  about  a  fixed  pivot.  Then 
the  position  of  every  point  of  the  body  is  fixed  if  we  specify  the  directions 
of  the  three  principal  axes  e±,  e2,  e3.  (This  would  be  true  for  any  three  axes 
fixed  in  the  body,  but  it  will  be  convenient  later  if  we  choose  them  to  be 
principal  axes.) 

Since  the  position  of  any  point  of  the  body  is  fixed  relative  to  these 
axes,  its  velocity  is  determined  by  the  three  velocities  ei,  e2,  e3.  In  fact,  if 
the  point  in  question  is 

r  =  r1ei+r2e2  +  r3e:i,  (9.30) 

then  r\,r2,r3  are  constants,  and 

r  =  r1e1+r2e2  +  r3e3.  (9.31) 

We  now  wish  to  show  that,  no  matter  how  the  body  is  moving,  we  can 
always  define  an  instantaneous  angular  velocity  u>.  Let  us  define  the  nine 
quantities 

Now,  since  ei  is  a  unit  vector,  e\  =  1.  Hence,  differentiating,  we  obtain 

0  =  ^(e?)  =  2ei  •  ei  =  2on. 
It  follows  that  k\  has  the  form 

ex  =  a21e2  +  a31e3.  (9.32) 
Moreover,  since  e\  and  e2  are  always  perpendicular,  we  have 


0  =  — (ei  •  e2)  =  ei  •  e2  +  e1  ■  e2  =  a21  +  a12. 
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Thus  we  can  define  a  vector  u>  by  setting 


^1  —  — «23  —  «32, 


k->2  —  —  «31  —  013) 


(9.33) 


Ll>3  =  — 012  =  021- 


With  this  definition,  (9.32)  becomes  simply 


ei  =  u;3e2  -  a;2e3  =  w  A  ei, 


(9.34) 


where  of  course 


w  =  Wiei  +  w2e2  +  a>3e3 


is  the  instantaneous  angular  velocity.  Substituting  this,  and  two  similar 
equations,  into  (9.31),  we  find  that  the  velocity  of  any  point  in  the  body  is 
given  by 


Thus  we  have  shown  that  an  instantaneous  angular  velocity  vector  uj  always 
exists.  We  note  that  the  direction  of  u>  is  the  instantaneous  axis  of  rotation: 
for  points  r  lying  on  this  axis,  the  instantaneous  velocity  r  is  zero. 

Now  let  us  consider  the  general  case,  in  which  no  point  of  the  body  is 
permanently  fixed.  Then  we  may  specify  the  position  of  the  body  by  the 
position  R  of  its  centre  of  mass  (or  of  any  other  designated  point),  and  by 
the  orientations  of  the  principal  axes  at  that  point.  We  can  now  define,  as 
before,  an  angular  velocity  vector  uj  relative  to  the  centre  of  mass.  If  R 
is  the  velocity  of  the  centre  of  mass,  then  the  instantaneous  velocity  of  a 
point  whose  position  relative  to  the  centre  of  mass  is  r*  is 


But,  since  r*  is  fixed  in  the  body,  we  can  apply  the  same  argument  as  above 
to  define  an  instantaneous  angular  velocity  vector  uj  and  write 


We  have  not  attached  an  asterisk  to  oj,  because  the  angular  velocity  is  in 
fact  independent  of  the  choice  of  origin.  For  example,  if  the  body  is  rotating 
with  angular  velocity  uj  about  a  fixed  pivot,  then 


f=uAr. 


(9.35) 


r  =  R  +  r*. 


r  =  R  +  oj  A  r*. 


(9.36) 


r  =  ojAr  =  ujAR  +  ojAr*. 
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But  the  instantaneous  velocity  of  the  centre  of  mass  is  of  course  R  =  wAfl, 
whence  subtracting  from  (9.36),  we  find 

r*  =  w  A  r*, 

which  shows  that  u>  is  also  the  angular  velocity  relative  to  the  centre  of 
mass. 

The  equation  of  motion  of  a  rigid  body  rotating  about  a  fixed  pivot  is 
the  angular  momentum  equation  (9.2), 

j  =  J2rAF,  (9.37) 

where  J  is  related  to  u>  by  (9.20): 

J  =  hwiei  +  h^2&2  +  h^ze-i. 

As  in  §9.2,  the  momentum  equation  (9.1)  serves  to  determine  the  reaction 
at  the  pivot. 

In  the  general  case,  we  have  to  determine  both  the  velocity  of  the  centre 
of  mass  and  the  angular  velocity.  The  motion  of  the  centre  of  mass  is 
described  by  the  momentum  equation  (9.1): 

p  =  MR  =  J2F,  (9-38) 

and  the  rotational  motion  by 

j*=Y^r*AF.  (9.39) 

The  angular  momentum  J*  about  the  centre  of  mass  is  of  course  related  to 
the  angular  velocity  by 

=  iiWiej  +  /2w2e2  +  h^>ze:i, 

where  e*,  e2,  are  the  principal  axes  at  the  centre  of  mass,  and  /*,  /2 ,  I£ 
are  the  corresponding  principal  moments. 

9.8    Rotation  about  a  Principal  Axis 

The  principal  axes  ei,  e2,  rotate  with  the  body.  Thus  if  we  wish  to  use 
directly  the  expression  for  J  in  terms  of  its  components  with  respect  to 
these  axes,  we  have  to  remember  that  they  constitute  a  rotating  frame.  In 
this  section,  we  shall  return  to  the  notation  of  Chapter  5,  and  distinguish 
the  absolute  rate  of  change  dJ/dt  from  the  relative  rate  of  change  J.  The 
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equation  of  motion  (9.37)  refers  of  course  to  the  absolute  rate  of  change,  so 
we  must  write  it  as 

C±L  =  J2rAF  =  G,  (9.40) 

say.  (The  argument  here  may  be  applied  equally  to  a  rigid  body  rotating 
about  a  fixed  pivot,  or  to  the  rotation  about  the  centre  of  mass.) 
The  relative  rate  of  change  is 

J  =  hwiei  +  I26j2e2  +  h^se-j, 

since  the  principal  moments  of  inertia  are  constants.  The  two  rates  of 
change  are  related  by 

dJ  . 

—  =  J  +  u>  A  J. 

at 

Substituting  in  (9.40),  we  obtain 

j  +  wAJ  =  G, 

or,  in  terms  of  components, 


hill!  +  (J3 

-h. 

)uJ2U>Z 

=  Gi, 

I2U)2  +  (h 

-h. 

)w3wi 

=  G2, 

(9.41) 

I3W3  +  (I2 

-h. 

=  G3. 

These  equations  may  be  solved,  in  principle,  to  give  the  angular  velocity 
components  as  functions  of  time.  However,  when  there  are  external  forces, 
they  are  not  often  particularly  useful,  because  these  forces  are  usually  spec- 
ified in  terms  of  their  components  with  respect  to  a  given  fixed  set  of  axes. 
Even  if  the  external  force  F  is  a  constant,  its  components  Fi,F2,F3  are 
variable,  and  depend  on  the  unknown  orientation  of  the  body.  For  this  kind 
of  problem,  an  alternative  method  of  solution,  using  Lagrange's  equations 
(to  be  described  in  the  next  chapter),  is  usually  preferable. 

We  shall  confine  our  discussion  here  to  the  case  where  there  are  no 
external  forces,  so  that  the  right  hand  sides  of  (9.41)  vanish.  If  the  body  is 
initially  rotating  about  the  principal  axis  e3,  so  that  u>\  =  u)2  =  0,  then  we 
see  from  (9.41)  that  they  remain  zero,  and  uj-j,  is  a  constant.  Thus  we  have 
verified  our  earlier  assertion  that  a  body  rotating  freely  about  a  principal 
axis  will  continue  to  rotate  with  constant  angular  velocity.  Now,  however, 
we  wish  to  investigate  the  stability  of  this  motion. 
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Example:  Stability  of  free  rotation  about  a  principal  axis 

A  rigid  body  is  rotating  freely  about  a  principal  axis,  say  e^, 
and  is  given  a  small  displacement.  Will  its  rotation  axis  remain 
close  to  e3? 

If  the  displacement  is  small  enough,  we  may  treat  u>\  and  u>2  (at 
least  initially)  as  small,  and  neglect  the  product  cuiu)2-  Thus  the  third 
equation,  with  G3  =  0,  tells  us  that  uj-i,  remains  almost  constant.  We 
can  solve  the  remaining  two  equations  (with  G\  =  G2  =  0)  by  looking 
for  solutions  of  the  form 

uji  =  a±ept,        L02  =  a,2ept, 

where  a\,  C12  andp  are  constants.  (This  is  essentially  the  method  we  used 
to  treat  the  the  problem  of  stability  of  equilibrium  in  one  dimension  in 
§§2.2,2.3.  We  shall  discuss  the  general  problem  of  stability  by  a  similar 
method  in  the  next  chapter.)  Substituting,  and  dividing  by  ept,  we  obtain 

hpai  +  (I3  ~  ^2)^3^2  =  0, 
I2pa2  +  (h  -  h^ax  =  0. 

Eliminating  the  ratio  ai/tt2,  we  find 

o  _  (h  -  h){h  -  h)  :  2 

J1J2 

The  denominator  is  obviously  positive,  as  is  w|.  Thus  if  I3  >  I±  and 
^3  >  h,  or  if  13  <  Ii  and  ^3  <  h,  the  two  roots  for  p  are  pure  imaginary, 
and  we  have  an  oscillatory  solution.  On  the  other  hand,  if  I\  >  I3  >  h 
or  Ii  <  I3  <  h,  then  the  roots  are  real,  and  the  values  of  lui  and  LU2  will 
in  general  increase  exponentially  with  time,  until  the  approximation  of 
treating  them  as  small  is  no  longer  valid. 

Therefore,  we  may  conclude  that  the  rotation  about  the  axis  es  is 
stable  if  ^3  is  either  the  largest  or  the  smallest  of  the  three  principal 
moments,  but  not  if  it  is  the  middle  one. 

This  interesting  result  is  easy  to  verify.  For  example,  if  one  throws  a 
matchbox  in  the  air,  it  is  not  hard  to  get  it  to  spin  about  its  longest  or  its 
shortest  axis,  but  it  will  not  spin  stably  about  the  other  one.  The  result 
applies  to  any  rigid  body.  It  is  often  called  the  tennis  racquet  theorem,  and 
we  will  examine  this  problem  from  a  different  perspective  in  §13.6. 
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9.9    Euler's  Angles 

The  orientation  of  a  rigid  body  about  a  fixed  point  —  or  about  its  centre  of 
mass  —  must  be  specified  by  three  angles.  These  may  be  chosen  in  various 
ways,  but  one  convenient  choice  is  the  set  of  angles  known  as  Euler's  angles. 
Two  of  these  are  required  to  fix  the  direction  of  one  of  the  axes,  say  e3  — 
they  are  just  the  polar  angles  9,  ip  —  and  the  third  specifies  the  angle 
through  which  the  body  has  been  rotated  from  a  standard  position  about 
this  axis. 

We  can  reach  an  arbitrary  orientation  by  starting  with  the  body  in  a 
standard  position,  and  making  three  successive  rotations.  Let  us  suppose 
that  initially  the  axes  ei,e2,e^  of  the  body  coincide  with  the  fixed  axes 
i,j,k.   (See  Fig.  9.9.)  The  first  two  rotations  are  designed  to  bring  the 


Fig.  9.9 

axis  e-j,  to  its  required  position,  specified  by  the  polar  angles  9,  (p.  First, 
we  make  a  rotation  through  an  angle  ip  about  the  axis  k.  This  brings  the 
three  axes  into  the  positions  e1,e'2,k.  Next,  we  make  a  rotation  through 
an  angle  9  about  the  axis  e'2,  bringing  the  axes  to  the  positions  e'^e'^e^. 
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Finally,  we  make  a  rotation  through  an  angle  ip  about  e$.  This  brings  all 
three  axes  to  their  required  positions  e\,  e2,  e3. 

Since  the  three  Euler  angles,  <p,9,ip,  fix  the  orientations  of  the  three 
axes,  they  specify  completely  the  orientation  of  the  rigid  body.  The  angular 
velocity  of  the  body  is  clearly  determined  by  the  rates  of  change  of  these 
three  angles.  A  small  change  in  ip  corresponds  to  a  rotation  about  k.  Hence 
if  ip  is  changing  at  a  rate  ip,  and  the  other  angles  are  constant,  the  angular- 
velocity  is  ipk.  Similarly,  if  only  8  or  ip  is  changing,  the  angular  velocity  is 
9e'2  or  ipe3-  When  all  three  angles  are  changing,  the  angular  velocity  is  the 
sum  of  these  three  contributions, 

w  =  (pk  +  9e'2  +  ipe3.  (9.42) 

In  order  to  find  the  angular  momentum  or  kinetic  energy,  using  (9.20) 
or  (9.22),  we  should  have  to  find  the  components  of  u>  in  the  directions 
of  the  three  principal  axes,  ei,e2,e3.  This  is  not  difficult,  but  we  shall 
not  go  through  the  derivation  in  the  general  case.  Instead,  we  limit  the 
discussion  to  the  case  of  a  symmetric  body,  with  symmetry  axis  e^,  and 
Ii  =Ii-  Then,  according  to  the  discussion  of  §9.4,  any  orthogonal  pair  of 
axes  in  the  plane  of  ei  and  e2  will  serve  as  principal  axes,  together  with 
e3.  It  is  not  necessary  that  the  axes  we  choose  be  fixed  in  the  body,  so 
long  as  they  are  always  principal  axes.  In  particular,  we  may  use  the  axes 
e'j,  e2.  This  choice  has  the  advantage  that  two  of  the  three  terms  in  (9.44) 
are  already  expressed  in  terms  of  these  axes  e'l5  e'2,e3. 

The  components  of  k  in  the  directions  of  e[,e'2,e-i  are  clearly 
(—  sin  8, 0,  cos  8),  so  that 

k  =  -sintfel  +cos0e3.  (9.43) 

Thus  from  (9.42),  we  find 

uj  =  -(psm8e[  +  8e2  +  (ip  +  (pcos8)e3.  (9.44) 

We  can  now  find  the  angular  momentum  from  (9.20).  All  we  have  to  do 
is  to  multiply  the  three  components  of  ui  by  the  principal  moments  Ii,Ii,I3. 
We  obtain 

J  =  -Ilipsm8e'1+I1de'2  +  I3(ip  +  <pcos8)e3.  (9.45) 

To  find  the  equations  of  motion,  we  could  either  re-express  J  in  terms  of 
the  fixed  axes  i,j,k,  or  write  down  equations  directly  in  the  rotating  frame 
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defined  by  e[,  e'2,ez-  We  shall,  however,  find  a  simpler  method  in  the  next 
chapter. 

The  kinetic  energy  is  obtained  similarly  from  (9.22).  It  is 

T=  i/iv92sin26>+  \h62  +  \lz{i) +  <p  cos  6)2.  (9.46) 


Free  motion  of  a  symmetric  rigid  body 

As  an  example  of  the  use  of  the  Euler  angles,  let  us  consider  a  symmetric 
body  moving  under  no  forces.  Its  centre  of  mass  moves  of  course  with 
uniform  velocity.  The  interesting  part  of  the  motion  is  the  rotation  about 
the  centre  of  mass.  The  same  formalism  applies  to  a  body  rotating  freely 
about  a  fixed  pivot. 

Since  there  are  no  external  forces,  the  angular  momentum  equation  is 
simply  dj /At  =  0.  The  angular  momentum  vector  therefore  points  in 
a  fixed  direction  in  space.  Though  we  could  solve  the  problem  in  terms 
of  arbitrary  axes,  it  will  be  convenient  to  choose  the  axis  k  to  be  in  the 
direction  of  the  constant  vector  J.  Then,  by  (9.43), 

J  =  Jk  =  —  Jsm9e[  +  J  cos  9  e-j. 

For  consistency,  this  must  be  identically  equal  to  the  expression  (9.45). 
Hence,  equating  the  three  components,  we  obtain 

IitpsinO  =  Js'mO, 

ij  =  0,  (9.47) 
-Z3  (ip  +  if  cos  9)  =  J  cos  9. 

From  the  second  equation,  we  learn  that  9  is  a  constant,  and  from  the  other 
two  that  (provided  9  ^  0  or  it)  (p  and  ip  are  constants.  Hence  the  axis  of  the 
body,  ez  rotates  around  the  direction  of  J  at  a  constant  rate  ip,  maintaining 
a  constant  angle  9  to  it,  and  in  addition  the  body  spins  about  its  axis  with 
constant  angular  velocity  ip  (see  Fig  9.10). 
The  angular  velocity  vector 

u  = -ip sine  e[  +  (ip  +  <p  cos  6)e3  (9.48) 

has  fixed  components  with  respect  to  e[,  e'2,ez-  It  too  therefore  maintains 
a  constant  angle  with  ez,  and  also  with  J.  In  space,  the  angular  velocity 
vector  lj  thus  describes  a  cone  around  the  direction  of  J,  precessing  at  the 
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Fig.  9.10 


rate  ip.  This  is  known  as  the  space  cone.  In  the  body,  since  oj  maintains  a 
constant  angle  to  e-j,  it  must  describe  a  cone  around  this  axis  —  the  body 
cone.  The  rate  at  which  it  describes  this  body  cone  is  ip.  (For,  uj  is  fixed 
with  respect  to  the  axes  e'x,e'2,e.z,  and  the  body  rotates  relative  to  them 
with  angular  velocity  ip.) 

Since  the  direction  of  ui  is  the  instantaneous  axis  of  rotation,  the  body 
cone  is  instantaneously  at  rest  along  the  line  in  the  direction  of  u>,  which  is 
where  it  touches  the  space  cone.  (See  Fig.  9.10.)  Thus  the  motion  may  be 
described  by  saying  that  the  body  cone  rolls  round  the  fixed  space  cone. 

From  (9.48)  we  see  that  the  semi-vertical  angle  f3  of  the  body  cone  is 
given  by 


tan/3 


uii  ip  sin  6  1% 
^3      ip  +  (fi  cos  9  h 


tan#, 


(9.49) 


by  (9.47).  In  particular,  if  the  body  is  nearly  totally  symmetric,  so  that 
\I\  —  H\  ^3 i  then  the  angles  (3  and  6  are  nearly  equal,  and  the  space  cone 
is  much  narrower  than  the  body  cone.  In  this  case,  it  is  easy  to  see  that 
^  «  i(3,  so  that  the  period  taken  to  describe  the  body  cone  is  much  greater 
than  that  for  the  space  cone. 

The  Earth's  axis  has  a  very  small  oscillation  of  this  kind,  called  the 
Chandler  wobble.  It  precesses  around  a  fixed  direction  in  space  roughly  once 
every  day,  and  moves  round  a  cone  in  the  Earth  with  semi-vertical  angle 
of  order  0.1"  (a  circle  of  about  3  m  radius  at  the  pole)  in  a  period  which 
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should  be  about  300  days  for  a  perfectly  rigid  Earth,  but  is  actually  around 
14  months,  with  considerable  variation.  In  this  case  I3  >  ii,  and  therefore 
(3  >  8,  so  the  body  cone  actually  rolls  around  with  the  space  cone  inside  it, 
and  ifi  <  0  (see  Fig.  9.11).  (Note  that  this  precession  is  quite  independent 


Fig.  9.11 

of  the  much  larger  effect,  the  precession  of  the  equinoxes,  discussed  at  the 
end  of  §9.6). 

9.10  Summary 

Determining  the  motion  of  a  rigid  body  under  known  forces  can  be  divided 
into  two  quite  separate  problems.  The  motion  of  the  centre  of  mass  is 
determined,  via  the  momentum  equation  (9.1),  by  the  total  force  acting 
on  the  body  —  or,  if  one  point  is  fixed,  the  force  is  determined  by  the 
motion  of  the  centre  of  mass.  The  rotational  motion  about  the  centre  of 
mass  is  determined,  via  the  angular  momentum  equation  (9.2),  by  the  total 
moment  of  the  forces  acting. 

The  relation  between  angular  momentum  and  angular  velocity  is  de- 
scribed by  the  inertia  tensor.  It  is  always  possible  to  find  a  set  of  principal 
axes  (fixed  in  the  body)  for  which  all  the  products  of  inertia  vanish.  Then 
J  is  obtained  from  uj  simply  by  multiplying  each  component  by  the  appro- 
priate principal  moment  of  inertia.  Similarly  the  kinetic  energy  T  is  the 
sum  of  the  three  terms  corresponding  to  rotation  about  each  of  the  three 
principal  axes. 
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In  general,  it  is  convenient  to  describe  the  rotational  motion  in  terms  of 
a  set  of  principal  axes.  Normally  these  rotate  with  the  body,  though  in  the 
special  case  of  a  symmetric  rigid  body  (with  I\  =12)  they  may  be  chosen  as 
in  §9.9.  In  that  case,  the  orientation  of  the  body  is  conveniently  described 
by  Euler's  angles.  We  shall  see  in  the  following  chapter  that  the  Lagrangian 
method  is  very  useful  for  obtaining  equations  of  motion  in  terms  of  Euler's 
angles. 


Problems 

1.  A  uniform  solid  cube  of  edge  length  2a  is  suspended  from  a  horizontal 
axis  along  one  edge.  Find  the  length  of  the  equivalent  simple  pendulum. 
Given  that  the  cube  is  released  from  rest  with  its  centre  of  mass  level 
with  the  axis,  find  its  angular  velocity  when  it  reaches  the  lowest  point. 

2.  An  insect  of  mass  100  mg  is  resting  on  the  edge  of  a  flat  uniform  disc 
of  mass  3g  and  radius  50  mm,  which  is  rotating  at  60  r.p.m.  about 
a  smooth  pivot.  The  insect  crawls  in  towards  the  centre  of  the  disc. 
Find  the  angular  velocity  when  it  reaches  it,  and  the  gain  in  kinetic 
energy.  Where  does  this  kinetic  energy  come  from,  and  what  happens 
to  it  when  the  insect  crawls  back  out  to  the  edge? 

3.  A  uniform  solid  cube  of  edge  2a  is  sliding  with  velocity  v  on  a  smooth 
horizontal  table  when  its  leading  edge  is  suddenly  brought  to  rest  by 
a  small  ridge  on  the  table.  Which  dynamical  variables  are  conserved 
(a)  before  impact,  (b)  during  impact,  and  (c)  after  impact?  Find  the 
angular  velocity  immediately  after  impact,  and  the  fractional  loss  of 
kinetic  energy.  Determine  the  minimum  value  of  v  for  which  the  cube 
topples  over  rather  than  falling  back. 

4.  A  pendulum  consists  of  a  light  rigid  rod  of  length  250  mm,  with  two 
identical  uniform  solid  spheres  of  radius  50  mm  attached  one  on  either 
side  of  its  lower  end.  Find  the  period  of  small  oscillations  (a)  perpen- 
dicular to  the  line  of  centres,  and  (b)  along  it. 

5.  A  uniform  rod  of  mass  M  and  length  2a  hangs  from  a  smooth  hinge 
at  one  end.  Find  the  length  of  the  equivalent  simple  pendulum.  It 
is  struck  sharply,  with  impulse  X,  at  a  point  a  distance  b  below  the 
hinge.  Use  the  angular  momentum  equation  to  find  the  initial  value  of 
the  angular  velocity.  Find  also  the  initial  momentum.  Determine  the 
point  at  which  the  rod  may  be  struck  without  producing  any  impulsive 
reaction  at  the  hinge.  Show  that,  if  the  rod  is  struck  elsewhere,  the 
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direction  of  the  impulsive  reaction  depends  on  whether  the  point  of 
impact  is  above  or  below  this  point. 

6.  *(a)  A  simple  pendulum  supported  by  a  light  rigid  rod  of  length  I  is 
released  from  rest  with  the  rod  horizontal.  Find  the  reaction  at  the 
pivot  as  a  function  of  the  angle  of  inclination. 

(b)  For  the  cube  of  Problem  1,  find  the  horizontal  and  vertical  compo- 
nents of  the  reaction  on  the  axis  as  a  function  of  its  angular  position. 
Compare  your  answer  with  the  corresponding  expressions  for  the  equiv- 
alent simple  pendulum. 

7.  *Find  the  principal  moments  of  inertia  of  a  flat  rectangular  plate  of 
mass  30  g  and  dimensions  80mmx60mm.  Given  that  the  plate  is  ro- 
tating about  a  diagonal  with  angular  velocity  15rads_1,  find  the  com- 
ponents of  the  angular  momentum  parallel  to  the  edges.  Given  that 
the  axis  is  of  total  length  120  mm,  and  is  held  vertical  by  bearings  at 
its  ends,  find  the  horizontal  component  of  the  force  on  each  bearing. 

8.  Find  the  principal  moments  of  inertia  of  a  uniform  solid  cube  of  mass 
to  and  edge  length  2a  (a)  with  respect  to  the  mid-point  of  an  edge,  and 
(b)  with  respect  to  a  vertex. 

9.  Find  the  moment  of  inertia  about  an  axis  through  its  centre  of  a  uniform 
hollow  sphere  of  mass  M  and  outer  and  inner  radii  a  and  b.  (Hint: 
Think  of  it  as  a  sphere  of  density  p  and  radius  a,  with  a  sphere  of 
density  p  and  radius  b  removed.) 

10.  A  spaceship  of  mass  3t  has  the  form  of  a  hollow  sphere,  with  inner 
radius  2.5  m  and  outer  radius  3  m.  Its  orientation  in  space  is  controlled 
by  a  uniform  circular  flywheel  of  mass  10  kg  and  radius  0.1m.  Given 
that  the  flywheel  is  set  spinning  at  2000  r.p.m.,  find  how  long  it  takes 
the  spaceship  to  rotate  through  1°.  Find  also  the  energy  dissipated  in 
this  manoeuvre. 

11.  A  long,  thin  hollow  cylinder  of  radius  a  is  balanced  on  a  horizontal 
knife  edge,  with  its  axis  parallel  to  it.  It  is  given  a  small  displacement. 
Calculate  the  angular  displacement  at  the  moment  when  the  cylinder 
ceases  to  touch  the  knife  edge.  (Hint:  This  is  the  moment  when  the 
radial  component  of  the  reaction  falls  to  zero.) 

12.  *Calculate  the  principal  moments  of  inertia  of  a  uniform,  solid  cone  of 
vertical  height  h,  and  base  radius  a  about  its  vertex.  For  what  value  of 
the  ratio  h/a  is  every  axis  through  the  vertex  a  principal  axis?  For  this 
case,  find  the  position  of  the  centre  of  mass  and  the  principal  moments 
of  inertia  about  it. 
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13.  A  top  consists  of  a  uniform,  solid  cone  of  height  50  mm  and  base  radius 
20  mm.  It  is  spinning  with  its  vertex  fixed  at  7200  r.p.m.  Find  the 
precessional  period  of  the  axis  about  the  vertical. 

14.  A  gyroscope  consisting  of  a  uniform  solid  sphere  of  radius  0.1  m  is  spin- 
ning at  3000  r.p.m.  about  a  horizontal  axis.  Due  to  faulty  construction, 
the  fixed  point  is  not  precisely  at  the  centre,  but  20  fim  away  from  it 
along  the  axis.  Find  the  time  taken  for  the  axis  to  move  through  1°. 

15.  A  gyroscope  consisting  of  a  uniform  circular  disc  of  mass  100  g  and 
radius  40  mm  is  pivoted  so  that  its  centre  of  mass  is  fixed,  and  is  spin- 
ning about  its  axis  at  2400  r.p.m.  A  5g  mass  is  attached  to  the  axis 
at  a  distance  of  100  mm  from  the  centre.  Find  the  angular  velocity  of 
precession  of  the  axis. 

16.  *A  uniformly  charged  sphere  is  spinning  freely  with  angular  velocity  u> 
in  a  uniform  magnetic  field  B.  Taking  the  z  axis  in  the  direction  of  a;, 
and  B  in  the  a:z-plane,  write  down  the  moment  about  the  centre  of  the 
magnetic  force  on  a  particle  at  r.  Evaluate  the  total  moment  of  the 
magnetic  force  on  the  sphere,  and  show  that  it  is  equal  to  (q/2M)J/\B, 
where  q  and  M  are  the  total  charge  and  mass,  respectively.  Hence 
show  that  the  axis  will  precess  around  the  direction  of  the  magnetic 
field  with  precessional  angular  velocity  equal  to  the  Larmor  frequency 
of  §5.5.  What  difference  would  it  make  if  the  charge  distribution  were 
spherically  symmetric,  but  non-uniform? 

17.  *A  wheel  of  radius  a,  with  its  mass  concentrated  on  the  rim,  is  rolling 
with  velocity  v  round  a  circle  of  radius  R  (3>  a),  maintaining  a  constant 
inclination  a  to  the  vertical.  Show  that  v  =  aoj  =  Rfl,  where  u  is 
the  angular  velocity  of  the  wheel  about  its  axis,  and  fl  (-C  u>)  is  the 
precessional  angular  velocity  of  the  axis.  Use  the  momentum  equation 
to  find  the  horizontal  and  vertical  components  of  the  force  at  the  point 
of  contact.  Then  show  from  the  angular  momentum  equation  about 
the  centre  of  mass  that  R  =  2v2 / gX&na.  Evaluate  R  for  v  =  5ms"1 
and  a  =  30°. 

18.  A  solid  rectangular  box,  of  dimensions  100  mm  x  60  mm  x  20  mm,  is  spin- 
ning freely  with  angular  velocity  240  r.p.m.  Determine  the  frequency 
of  small  oscillations  of  the  axis,  if  the  axis  of  rotation  is  (a)  the  longest, 
and  (b)  the  shortest,  axis. 

19.  *A  rigid  body  of  spheroidal  shape,  spinning  rapidly  about  its  axis  of 
symmetry,  is  placed  on  a  smooth  flat  table.  Show  by  considering  the 
moment  of  the  force  at  the  point  of  contact  that  its  axis  will  precess 
in  one  direction  if  it  is  oblate  (c  <  a  =  b,  e.g.  a  discus)  and  in  the 
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opposite  direction  if  it  is  prolate  (c  >  a  =  b,  e.g.  a  rugby  ball).  Show 
also  that  if  there  is  a  small  frictional  force,  the  axis  will  become  more 
nearly  vertical,  so  that  if  the  body  is  oblate  its  centre  of  mass  will  fall, 
but  if  it  is  prolate  it  will  rise. 

20.  *The  average  moment  exerted  by  the  Sun  on  the  Earth  is,  except  for 
sign,  identical  with  the  expression  found  in  Chapter  6,  Problem  26, 
provided  we  interpret  m  as  the  mass  of  the  Sun,  and  r  as  the  distance 
to  the  Sun.  Show  that  Q  =  —2{I^  —  Ii)  and  hence  that  the  precessional 
angular  velocity  produced  by  this  moment  is 

11  =  —  —  — ;  cos  an, 

2     h  u 

where  -co  is  the  Earth's  orbital  angular  velocity,  and  a  =  23.45°  is 
the  tilt  between  the  Earth's  axis  and  the  normal  to  the  orbital  plane 
(the  ecliptic).  Show  also  that  (I3  —  Iij/Iz  ~  e,  the  oblateness  of  the 
Earth,  and  hence  evaluate  fl.  Why  is  this  effect  less  sensitive  to  the 
distribution  of  density  within  the  Earth  than  the  complementary  one 
discussed  in  §6.5? 

21.  *The  axis  of  a  gyroscope  is  free  to  rotate  within  a  smooth  horizontal 
circle  in  colatitude  A.  Due  to  the  Coriolis  force,  there  is  a  couple  on  the 
gyroscope.  To  find  the  effect  of  this  couple,  use  the  equation  for  the 
rate  of  change  of  angular  momentum  in  a  frame  rotating  with  the  Earth 
{e.g.,  that  of  Fig.  5.7),  J+flA  J  =  G,  where  G  is  the  couple  restraining 
the  axis  from  leaving  the  horizontal  plane,  and  fl  is  the  Earth's  angular- 
velocity.  (Neglect  terms  of  order  fl2,  in  particular  the  contribution  of 
fl  to  J.)  From  the  component  along  the  axis,  show  that  the  angular 
velocity  u>  about  the  axis  is  constant;  from  the  vertical  component  show 
that  the  angle  ip  between  the  axis  and  east  obeys  the  equation 

I\(p  —  I^uQ.  sin  A  cos  1/5  =  0. 

Show  that  the  stable  position  is  with  the  axis  pointing  north.  Determine 
the  period  of  small  oscillations  about  this  direction  if  the  gyroscope  is  a 
flat  circular  disc  spinning  at  6000  r.p.m.  in  latitude  30°  N.  Explain  why 
this  system  is  sensitive  to  the  horizontal  component  of  fl,  and  describe 
the  effect  qualitatively  from  the  point  of  view  of  an  inertial  observer. 
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Lagrangian  Mechanics 


We  have  already  seen  that  the  equations  of  motion  for  a  system  of  N  parti- 
cles moving  under  conservative  forces  may  be  obtained  from  the  Lagrangian 
function  in  terms  of  any  set  of  3N  independent  co-ordinates.  (See  §8.5.) 
In  this  chapter,  we  shall  give  a  more  systematic  account  of  the  Lagrangian 
method,  and  apply  it  in  particular  to  the  case  of  rigid  bodies. 

10.1     Generalized  Co-ordinates;  Holonomic  Systems 

Let  us  consider  a  rigid  body,  composed  of  a  large  number  TV  of  particles. 
The  positions  of  all  the  particles  may  be  specified  by  3iV  co-ordinates. 
However,  these  3N  co-ordinates  cannot  all  vary  independently,  but  are 
subject  to  constraints  —  the  rigidity  conditions.  In  fact,  the  position  of 
every  particle  may  be  fixed  by  specifying  the  values  of  just  six  quantities 
—  for  instance,  the  three  co-ordinates  X,  Y,  Z  of  the  centre  of  mass  and 
the  three  Euler  angles  ip,  9,  ip  which  determine  the  orientation.  These  six 
constitute  a  set  of  generalized  co-ordinates  for  the  rigid  body. 

In  particular  problems,  these  co-ordinates  may  be  subject  to  further 
constraints,  which  may  be  of  two  kinds.  First,  we  might  for  example  fix  the 
position  of  one  point  of  the  body,  say  the  centre  of  mass.  Such  constraints 
are  represented  by  algebraic  conditions  on  the  co-ordinates  (e.g.,  X  =  Y  = 
Z  =  0),  which  may  be  used  to  eliminate  some  of  the  co-ordinates.  In  this 
particular  case,  the  three  Euler  angles  alone  suffice  to  fix  the  position  of 
every  particle. 

The  second  type  of  constraint  is  represented  by  conditions  on  the  ve- 
locities rather  than  the  co-ordinates.  For  example,  we  might  constrain  the 
centre  of  mass  to  move  with  uniform  velocity,  or  to  move  round  a  circle 
with  uniform  angular  velocity.  Then,  in  place  of  algebraic  equations,  we 
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have  differential  equations  (e.g.,  X  =  u).  In  simple  cases,  these  equations 
can  be  solved  to  find  some  of  the  co-ordinates  as  explicit  functions  of  time 
(e.g.,  X  =  Xq  +  ut).  Then  the  position  of  every  particle  will  be  determined 
by  the  values  of  the  remaining  generalized  co-ordinates  and  the  time  t. 

In  general,  we  say  that  q\,  qi, . . . ,  qn  is  a  set  of  generalized  co-ordinates 
for  a  given  system  if  the  position  of  every  particle  in  the  system  is  a  function 
of  these  variables,  and  perhaps  also  explicitly  of  time: 

ri  =  ri(q1,q2,...,q„,t).  (10.1) 

The  number  of  co-ordinates  which  can  vary  independently  is  called  the 
number  of  degrees  of  freedom  of  the  system.  If  it  is  possible  to  solve  the 
constraint  equations,  and  eliminate  some  of  the  co-ordinates,  leaving  a  set 
equal  in  number  to  the  number  of  degrees  of  freedom,  the  system  is  called 
holonomic.  If  this  elimination  introduces  explicit  functions  of  time,  the 
system  is  said  to  be  forced]  on  the  other  hand,  if  all  the  constraints  are 
purely  algebraic,  so  that  t  does  not  appear  explicitly  in  (10.1),  the  system 
is  natural. 

There  do  exist  non-holonomic  systems,  for  which  the  constraint  equa- 
tions cannot  be  solved  to  eliminate  some  of  the  co-ordinates.  Consider,  for 
example,  a  sphere  rolling  on  a  rough  plane.  Its  position  may  be  specified 
by  five  generalized  co-ordinates  —  X,  Y,  <p,  9,  ip.  (Z  is  a  constant  and  may 
be  omitted.)  However,  the  sphere  can  roll  in  only  two  directions,  and  the 
number  of  degrees  of  freedom  is  two  (three  if  we  allow  it  also  to  spin  about 
the  vertical).  The  constraint  equations  —  the  rolling  conditions  —  serve 
to  determine  the  angular  velocity  in  terms  of  the  velocity  of  the  centre  of 
mass.  Using  the  fact  that  the  instantaneous  axis  of  rotation  must  be  a 
horizontal  axis  through  the  point  of  contact,  it  is  not  hard  to  show  that 
cj  =  k  A  R/ a,  where  a  is  the  radius  of  the  sphere.  (This  follows  from  the 
fact  that  Ft  =  u>  A  ak,  together  with  k  ■  u>  =  0.)  However,  these  equations 
cannot  be  integrated  to  find  the  orientation  in  terms  of  the  position  of  the 
centre  of  mass;  for,  we  could  roll  the  sphere  round  a  circle  so  that  it  returns 
to  its  starting  point  but  with  a  different  orientation. 

For  the  present,  we  shall  assume  that  the  constraint  equations  can  be 
solved,  and  the  number  of  generalized  coordinates  taken  equal  to  the  num- 
ber of  degrees  of  freedom. 

The  distinction  between  a  natural  and  a  forced  system  can  be  expressed 
in  another  way,  which  will  be  useful  later.  Differentiating  (10.1),  we  find 
that  the  velocity  of  the  ith  particle  is  a  linear  function  of  q\, . . .  ,qn,  though 
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in  general  it  depends  in  a  more  complicated  way  on  the  co-ordinates  them- 
selves: 

Edri  .  dri 

The  last  term  arises  from  the  explicit  dependence  on  t  in  (10.1),  and  is 
absent  for  a  natural  system.  When  we  substitute  in  T  =  Yl  \mv  >  we 
obtain  a  quadratic  function  of  the  time  derivatives  qi,...,qn.  For  a  natural 
system,  it  is  a  homogeneous  quadratic  function;  but  for  a  forced  system 
there  are  also  linear  terms  and  terms  independent  of  the  velocities. 

For  example,  according  to  (8.29)  and  (9.46),  the  kinetic  energy  of  a 
symmetric  rigid  body  is 

T  =  \M(X2  +  Y2  +  Z2)  +  \l{{if>2  sin2  6  +  62)  +  ±J| (ip  +  cpcostf)2. 

If  we  impose  further  algebraic  constraints,  such  as  X  =  0,  the  corresponding 
terms  drop  out,  and  we  are  still  left  with  a  homogeneous  quadratic  function 
of  the  remaining  time  derivatives.  On  the  other  hand,  if  we  impose  a 
differential  constraint,  such  as  X  =  u  or  ip  =  uj,  we  obtain  a  function  with 
constant  or  linear  terms. 

10.2    Lagrange's  Equations 

To  obtain  a  more  general  form  of  Lagrange's  equations  than  the  one  found 
in  §3.7  —  in  particular,  one  that  is  not  restricted  to  conservative  forces  — 
let  us  return  for  the  moment  to  the  case  of  a  single  particle,  moving  under 
an  arbitrary  force  F.  We  consider  variations  of  the  integral 

rti 

1=1    Tdt,       where       T=\mr2.  (10.2) 

J  t0 

Now  let  us  make  a  small  variation  6x(t)  in  the  x  co-ordinate,  subject  to 
the  boundary  conditions  5x(to)  =  5x(ti)  =  0.  Then  clearly  6T  =  mx5x. 
Substituting  in  (10.2),  and  performing  an  integration  by  parts,  in  which 
the  integrated  term  vanishes  by  virtue  of  the  boundary  conditions,  we  find 

51  =  —  mxdxdt. 

J  t0 

Now,  the  integrand  is  —Fx5x  =  —SW,  where  5W  is  the  work  done  by  the 
force  F  in  the  displacement  Sx.  Similarly,  if  we  make  variations  of  all  three 
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coordinates,  we  find 

51  =  -  r  SWdt,  (10.3) 

Jto 

where  SW  =  F  ■  5r  is  the  work  done  by  the  force  in  the  displacement  Sr. 

The  advantage  of  (10.3)  is  that,  like  Hamilton's  principle  (see  §3.7),  it 
makes  no  explicit  reference  to  any  particular  set  of  co-ordinates.  Hence 
we  can  use  it  to  find  equations  of  motion  in  terms  of  an  arbitrary  set 
of  co-ordinates  qi,q2,qz-  Let  us  define  the  generalized  forces  Fi,F2,Fs 
corresponding  to  F  by 

SW  =  Frfqx  +  F2Sq2  +  F35q3.  (10.4) 

Then  we  may  equate  SI  as  given  by  (10.3)  to  the  general  expression  (3.37). 
Consider,  for  example,  a  variation  5q±  of  q\.  Then,  according  to  (3.37), 


'dT 

fOT\] 

n0 

dqi 

-  - 

dt 

\dqj_ 

SI 

On  the  other  hand,  by  (10.3)  and  (10.4), 

rti 


Sqx(t)  dt. 


51  =  -  [  F15q1{t)dt. 


These  two  expressions  must  be  equal  for  arbitrary  variations  subject  only  to 
the  boundary  conditions  that  Sqi(to)  =  Sqi(ti)  =  0.  Hence  the  integrands 
must  be  equal,  and  we  obtain  Lagrange 's  equations  in  the  form 

dt  \dqa  J  dqa 

For  example,  for  a  particle  moving  in  a  plane,  and  described  by  polar 
co-ordinates,  T  =  ^m(r2  +  r282).  Hence  Lagrange's  equations  are 

j(mr)  =  mr82  +  Fr, 

dt  (10.6) 
±(mr29)=Fe. 

It  is  important  to  remember  that  Fg  here  does  not  mean  the  component 
of  the  vector  F  in  the  direction  of  increasing  9.  (Compare  the  discussion 
following  (3.47).)  It  is  defined  by  (10.4),  or 


5W  =  Fr5r  +  FgSO. 
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The  work  done  in  producing  a  small  angular  displacement  69  is  equal  to  59 
multiplied  by  the  moment  of  the  force  F  about  the  origin.  Thus  Fg  here 
is  in  fact  the  moment  of  F  about  the  origin.  (Recall  that  the  generalized 
momentum  pe  =  mr  9  is  the  angular  momentum.  Thus  the  second  of 
Eqs.  (10.6)  expresses  the  fact  that  the  rate  of  change  of  angular  momentum 
is  equal  to  the  moment  of  the  force.) 

If  the  force  is  conservative,  then  5W  =  —5V,  where  V  is  the  potential 
energy.  It  then  follows  from  (10.4)  that 


dV 
dqa 


(10.7) 


Thus,  defining  the  Lagrangian  function  L  =  T  —  V,  we  can  write  (10.5)  in 
the  form  (3.44)  obtained  earlier, 

dt  (dqa)      dqa'  ^°'8'> 

Note  that  the  'generalized  force'  dL/dqa  on  the  right  hand  side  of  this 
equation  includes  not  only  the  force  Fa  determined  by  F  but  also  the  'ap- 
parent' force  dT/dqa  arising  from  the  curvilinear  nature  of  the  co-ordinates 
(for  example,  the  centrifugal  term  mr92  in  (10.6)). 

More  generally,  even  if  F  is  non-conservative,  it  may  be  possible  to  find 
an  effective  'potential  energy'  function  V  depending  on  the  velocities  as 
well  as  the  co-ordinates,  and  such  that 

dV      d  f  8V\ 

+  777  Ur-  •  (10-9) 


dqa      dt  \dqa 

In  this  case  too,  we  can  define  L  =  T  —  V,  and  write  Lagrange's  equations 
in  the  form  (10.8).  The  most  important  example  of  a  force  of  this  type 
in  the  electromagnetic  force  on  a  charged  particle,  which  we  shall  discuss 
in  §10.5.  Note  that  in  this  case  the  'potential  energy'  V  contributes  not 
only  to  the  'generalized  force'  on  the  right  side  of  (10.8)  but  also  to  the 
'generalized  momentum'  on  the  left. 

There  is  nothing  in  the  discussion  leading  to  Lagrange's  equations  (10.5) 
or  (10.8)  that  restricts  their  applicability  to  the  case  of  a  single  particle. 
The  general  principle  embodied  in  (10.3)  applies  to  any  system  whatsoever, 
containing  any  number  of  particles.  If  the  system  is  holonomic,  with  n 
degrees  of  freedom,  then  we  can  make  independent  arbitrary  variations  of 
the  generalized  co-ordinates  qi,  q2,  ■  ■  ■ ,  qn,  and  obtain  Lagrange's  equations 
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(10.5),  in  which  the  generalized  forces  are  defined,  as  in  (10.4),  by 

n 

5W  =  FiSqi  +  F25q2  +  ■■■  +  Fn5qn  =  ^2  Fa5qa.  (10.10) 

a=l 

Finally,  if  the  forces  are  conservative,  or  more  generally  if  they  can  be 
written  in  the  form  (10.9),  then  Lagrange's  equations  may  be  expressed  in 
the  form  (10.8). 

10.3    Precession  of  a  Symmetric  Top 

As  a  first  example  of  the  use  of  Lagrange's  equations,  we  consider  in  more 
detail  the  problem  of  the  symmetric  top,  discussed  in  the  previous  chapter. 
This  is  a  symmetric  rigid  body,  pivoted  at  a  point  on  its  axis  of  symmetry, 
and  moving  under  gravity.  (Compare  Fig.  9.7.) 

Example:  Steady  precession  of  a  top 

Find  the  conditions  under  which  steady  precession  at  a  given 
constant  angle  of  inclination  of  the  axis,  9,  can  occur.  Show 
that  for  any  9  <  it/2,  there  is  a  minimum  value  of  the  rotational 
angular  velocity  about  the  axis  for  which  this  is  possible. 

This  system  has  three  degrees  of  freedom,  and  we  use  the  three 
Euler  angles  as  generalized  co-ordinates.  The  kinetic  energy  is  given  by 
(9.46),  and  the  potential  energy  is  V  =  MgR  cos  9.  Thus  the  Lagrangian 
function  is 

L  =  i/1v92sin26»+  \h§2  +  \h(^  +  ipcos6)2  -  MgRcosO.  (10.11) 
Lagrange's  equation  (10.8)  for  9  is  therefore 

^-(h0)  =  hp2  sin^ cos 9  -  I3(ip  +  <pcos9)Lpsm9  +  MgRsin9.  (10.12) 

The  Lagrangian  function  does  not  involve  the  other  two  Euler  angles, 
but  only  their  time  derivatives,  so  the  corresponding  equations  express 
the  constancy  of  the  generalized  momenta  pv  and  p^: 

^-[Iiips'm2  9  +  I3(ip  +  tpcos9)  cos9]  =  0, 
d*  d       .  (10.13) 

—  [J3W»  +  0cos0)]  =0. 
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Note  that  this  last  equation  tells  us  that  the  component  of  angular  ve- 
locity u>3  about  the  axis  of  symmetry  is  constant 

^3  =  ip  +  ipcosO  =  constant.  (10.14) 

From  the  equations  (10.13),  given  that  6  remains  constant,  we  learn 
that  both  (p  and  ip  must  be  constant.  Thus  the  axis  of  the  top  precesses 
around  the  vertical  with  constant  angular  velocity  tp  =  fl,  say.  To  find 
the  relation  between  the  angular  velocities  and  uiz,  we  examine  the  6 
equation,  (10.12).  Since  the  left  side  must  vanish,  we  obtain  (for  sin#  ^ 
0,  that  is  unless  the  axis  is  vertical), 

hfl2  cos9  -  /3w30  +  MgR  =  0.  (10.15) 

The  minimum  value  of  ^3  for  which  real  roots  for  O  exist  is  given  by 

7|w2  =  AhMgRcosO. 

If  the  top  is  spinning  more  slowly  than  this,  it  will  start  to  wobble.  For 
any  larger  value  of  u>z  there  are  two  possible  values  of  O,  the  roots  of 
(10.15). 

It  is  interesting  to  examine  the  special  case  where  LO3  is  large.  Then  there 
is  one  root  for  U  which  is  much  less  than  0*3,  given  by  fi  ss  M gR/I-jUi-j,  and 
another  which  is  comparable  in  magnitude  to  UJ3,  CI  ss  /3W3//1  cos#.  The 
first  of  these  is  just  the  approximate  result  we  obtained  in  §9.6  on  the 
assumption  that  Q  was  much  smaller  than  (see  (9.29)).  The  second 
represents  a  rapid  precessional  motion  in  which  the  gravitational  force  is 
negligible.  It  corresponds  precisely  to  the  free  precessional  motion  of  a  rigid 
body,  discussed  in  §9.9. 

Note  that  if  we  allow  0  to  be  greater  than  ir/2  —  that  is,  if  we  consider 
a  top  suspended  below  its  point  of  support  —  then  steady  precession  is 
possible  for  any  value  of  W3.  In  particular,  for  cu^  =  0,  we  find  the  possible 
angular  velocities  of  a  compound  pendulum  swinging  in  a  circle, 


n  =  ± 


MgR 
ii|  cos#| 


We  shall  discuss  the  more  general  motion  of  this  system  in  Chapter  12, 
using  the  Hamiltonian  methods  developed  there. 
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10.4    Pendulum  Constrained  to  Rotate  about  an  Axis 

Next,  let  us  consider  the  system  illustrated  in  Fig.  10.1.  It  consists  of  a 
light  rigid  rod,  of  length  I,  carrying  a  mass  m  at  one  end,  and  hinged  at 


m 


Fig.  10.1 

the  other  to  a  vertical  axis,  so  that  it  can  swing  freely  in  a  vertical  plane. 
We  suppose  initially  that,  in  addition  to  gravity,  a  known  torque  G{t)  is 
applied  to  rotate  it  about  the  axis. 

This  system  has  two  degrees  of  freedom.  Its  position  may  be  described 
by  the  two  polar  angles  9,  tp.  (For  convenience,  we  take  the  z-axis  vertically 
downward,  so  that  9  =  0  is  the  equilibrium  position.) 

The  Lagrangian  function  is 

L  =  \ml2(92  +  <p2  sin2  9)  -  mgl(l  -  cos  9).  (10.16) 

Since  the  work  done  by  the  torque  G  is  SW  =  G  6<p,  Lagrange's  equations 
are 


or,  explicitly, 


d 

(dL\ 

dL 

d 

fdL 

dt 

\d9) 

dt 

[dip 

ml29 

=  ml2p2 

sin# 

cos  9 

(10.17) 


^(ml2psm29)  =  G.  (10.18) 


Now  let  us  suppose  that  the  torque  G  is  adjusted  to  constrain  the  system 
to  rotate  with  constant  angular  velocity  ui  about  the  vertical.  This  imposes 
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the  constraint  ip  =  u>,  and  the  system  may  be  regarded  as  a  system  with 
one  degree  of  freedom,  described  by  the  co-ordinate  9.  Substituting  this 
constraint  into  the  Lagrangian  function  (10.16),  we  find 

L  =  \ml2(92  +  J2  sin2  9)  -  mgl{\  -  cos  9).  (10.19) 

Note  the  appearance  in  the  kinetic  energy  part  of  L  of  a  term  indepen- 
dent of  9,  which  is  characteristic  of  a  forced  system.  It  is  often  useful  in 
problems  of  this  kind  to  separate  the  Lagrangian  function  into  a  'kinetic' 
term  T',  quadratic  in  the  time  derivatives,  and  a  'potential'  term  —V, 
independent  of  them.  In  this  case,  we  write  L  =  T'  —  V' ,  where 

T'  =  \ml292,        V  =  mgl{\  -  cos6>)  -  \ml2uj2  sin2  9. 

Physically,  this  corresponds  to  using  a  rotating  frame,  rotating  with  angular 
velocity  u>.  Here  T'  is  the  kinetic  energy  of  the  motion  relative  to  this 
frame,  and  the  extra  term  in  V  is  the  potential  energy  corresponding  to 
the  centrifugal  force. 

For  a  forced  system,  the  constraining  forces  can  do  work  on  the  system, 
so  the  total  energy  T  +  V  is  not  in  general  a  constant.  However,  in  our  case, 
there  is  still  a  conservation  law.  Multiplying  (10.17)  by  9  and  integrating, 
we  find 

T'  +  V'  =  E'  =  constant.  (10.20) 

This  is  not  the  total  energy  T  +  V,  because  the  centrifugal  term  appears 
with  the  opposite  sign. 

It  is  easy  to  verify  the  consistency  of  this  by  calculating  the  work  done 
on  the  system.  The  rate  at  which  the  torque  does  work  is  Geo.  Thus,  from 
(10.18), 

A(T  +  v)  =  Glj  =  ^-(ml2Lj2  sin2  6). 

Since  T  +  V  =  T  +  V  -  ml2  J2  sin2  9,  this  is  equivalent  to  (10.20). 

We  can  use  this  conservation  law,  just  as  we  did  in  §2.1,  to  find  the 
qualitative  features  of  the  motion. 

Example:  Types  of  motion  of  the  constrained  pendulum 

Describe  the  possible  types  of  motion  of  the  constrained  pendu- 
lum for  a  given  value  of  u>. 
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To  do  this,  we  draw  the  'effective  potential  energy'  diagram  for  the 
function  V.  There  are  two  distinct  cases,  depending  on  the  value  of 
u.  If  ui2  <  g/l  (that  is,  if  the  rotation  period  is  longer  than  the  free 
oscillation  period  of  the  pendulum),  then  V'  has  a  minimum  at  9  =  0, 
and  a  maximum  at  6  =  n,  as  shown  in  the  upper  curve  in  Fig.  10.2.  The 
motion  is  qualitatively  just  like  that  of  the  ordinary  pendulum,  though 
the  period  is  longer. 


Fig.  10.2 

If  ui2  >  g/l,  then  V'  has  maxima  at  both  0  and  n,  and  an  interme- 
diate minimum  at  9  =  9()  =  wccos{g /lui2),  as  shown  in  the  lower  curve 
in  Fig.  10.2.  The  equilibrium  position  at  9  =  0  is  then  unstable.  The 
stable  equilibrium  position  at  9$  is  the  one  where  the  transverse  compo- 
nents of  the  gravitational  and  centrifugal  forces  are  in  equilibrium.  For 
this  case,  three  types  of  motion  are  possible.  If  E'  <  0,  the  pendulum 
oscillates  around  this  stable  position,  without  ever  reaching  the  verti- 
cal. For  0  <  E'  <  2mgl,  it  swings  from  one  side  to  the  other  as  before, 
though  9  =  0  is  no  longer  the  position  of  maximum  velocity.  Finally,  if 
E'  >  2mgl,  we  have  a  continuous  circular  motion,  passing  through  both 
the  upward  and  downward  verticals. 

There  is  a  more  general  way  of  handling  constraints  (holonomic  or  not), 
using  the  method  of  Lagrange  multipliers  (see  Appendix  A,  Problem  10). 
To  impose  the  constraint  (p  =  ui  on  our  system,  we  introduce  a  new  variable 
A,  the  Lagrange  multiplier,  and  subtract  A  times  the  constraint  from  the 
Lagrangian  function.  From  (10.16),  we  obtain  in  this  way 


L  =  \ml2(92  +  <p2  sin2  9)  -  mgl(l  -  cos (9)  -  \(<p  -  uj).  (10.21) 
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The  Euler-Lagrange  equation  for  6,  (10.17)  is  unchanged.  That  for  ip  be- 
comes 

^(ml2<psm26  -  A)  =  0.  (10.22) 

In  addition,  we  have  the  Euler-Lagrange  equation  for  A,  which  simply  re- 
produces the  constraint, 

0  =  tp  —  ui. 

Comparing  (10.22)  with  (10.18),  we  see  that  A  is  physically  the  torque  G 
required  to  impose  the  constraint.  Equivalently,  we  could  have  integrated 
the  constraint,  and  subtracted  \(ip  —  cot)  from  L.  In  that  case,  A  itself 
would  be  the  torque.  In  general,  the  physical  significance  of  A  may  be 
found  by  considering  a  small  virtual  change  in  the  constraint  equation. 
This  method  is  particularly  useful  when  we  want  to  find  the  constraining 
forces  or  torques. 


10.5    Charged  Particle  in  an  Electromagnetic  Field 

We  now  wish  to  consider  one  of  the  most  important  examples  of  a  non- 
conservative  force.  We  consider  a  particle  of  charge  q  moving  in  an  electric 
field  E  and  a  magnetic  field  B.  The  force  on  the  particle  is  then 

F  =  q(E  +  vAB),  (10.23) 

or,  in  terms  of  components, 

Fx  =  qEx  +  q(yBz  -  zBy),  (10.24) 

together  with  two  similar  equations  obtained  by  cyclic  permutation  of 
x,y,z. 

Now,  we  wish  to  show  that  this  force  may  be  written  in  the  form  (10.9) 
with  a  suitably  chosen  function  V.  To  do  this,  we  have  to  make  use  of  a 
standard  result  of  electromagnetic  theory  (see  §A.7),  according  to  which 
it  is  always  possible  to  find  a  scalar  potential  <p  and  a  vector  potential  A, 
functions  of  r  and  t,  such  that 

BA 

E  =  — — ,       B  =  VAA.  (10.25) 

For  time- independent  fields,  4>  is  simply  the  electrostatic  potential  of  Chap- 
ter 6. 
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Now,  let  us  consider  the  function 


V  =  q<f>(r,  t)  -  qr  ■  A(r,  t) 
=  q<j>-  q(xAx  +  yAy  +  zAz). 


(10.26) 


Clearly, 


dV         do  ,     ( .  dAx  t  .  dAy  ,  .  OA 

Also, 


dx        ^  dx     ^\dx         dx  +  Z  dx 


d  fdV\  dAT 


dt  V  dx  )        q  dt  ' 

so,  bearing  in  mind  that  Ax  varies  with  time  both  because  of  its  explicit 
time  dependence,  and  because  of  its  dependence  on  the  particle  position 
r(t),  we  find 

d  fdV\  (dAx     dAx  .     dAx  .     dAx  . 

di{-di)=-q{-dr  +  ^x-X+^y-y+^z-Z 

Adding,  we  find  that  the  terms  in  x  cancel,  whence 

dV      d  fdV\        (   86  dA 


+  37  hi  =9 


dx     dt  V  dx  J        \    dx  dt 


dAy     dAx\      .  I'd  A,  dAx 
1  +  z 


dx       dy  J       \  dx  dz 
=  qEx  +  q(yBz  -  zBy), 

by  (10.25).  But  this  is  just  the  expression  for  Fx  given  by  (10.24).  Hence 
we  have  verified  that  F  is  given  by  (10.9)  with  V  equal  to  (10.26). 

It  follows  that  the  equations  of  motion  for  a  particle  in  an  electromag- 
netic field  may  be  obtained  from  the  Lagrangian  function 

L  =  \mr2  +  qr  ■  A(r,  t)  -  q6(r,  t).  (10.27) 

Note  the  appearance  in  L  of  terms  linear  in  the  time  derivatives.  This 
function  cannot  be  separated  into  two  parts,  T'  —  V,  one  quadratic  in 
r  and  one  independent  of  it.  Another  consequence  of  the  appearance  of 
these  terms  is  that  the  generalized  momentum  px  is  no  longer  equal  to  the 
familiar  mechanical  momentum,  mx.  Instead, 

Px  =  -qT  =  mx  +  qAx, 
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or,  more  generally, 

p  =  mr  +  qA.  (10.28) 

We  can  now  obtain  the  equations  of  motion  in  terms  of  arbitrary  co- 
ordinates from  the  Lagrangian  function  (10.27).  For  example,  in  terms  of 
cylindrical  polars,  it  reads 

L  =  \m{p2  +  p2ip2  +  z2)  +  q(pAp  +  ppAv  +  zAz)  -  q<j>.  (10.29) 


Example:  Uniform  magnetic  field 

A  charged  particle  moves  in  a  uniform  static  magnetic  field  B. 
Find  the  solutions  of  the  equations  of  motion  in  which  p  is  con- 
stant. 

For  a  uniform  magnetic  field,  we  may  take 

4>  =  0       and       A  =  ^B  A  r, 

or,  if  we  choose  the  z-axis  in  the  direction  of  B, 

0  =  0,       Ap  =  0,       Av  =  \Bp,       Az  =  0. 

(It  is  easy  to  verify  the  relation  B  =  VAA  using  (A. 55).)  Thus  the 
Lagrangian  function  is 

L  =  \m(p2  +  p2Lp2  +  z2)  +  ^qBp2ip.  (10.30) 

Lagrange's  equations  are  therefore 

rap  =  rnpip2  +  qBpip, 

^-t(mp2ip+yBp2)  =  0,  (10.31) 

mz  =  0. 

In  particular,  in  the  case  where  p  is  a  constant,  we  learn  from  the 
last  two  equations  that  tp  and  z  are  also  constants,  and  from  the  first 
equation  that  either  ip  =  0  (particle  moving  parallel  to  the  z-axis),  or 

.  =_q_B 
^        m  ' 

This  is  of  course  precisely  the  solution  we  obtained  in  §5.2. 
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The  second  of  the  three  equations  (10.31)  is  particularly  interesting.  It 
shows  that,  although  in  general  the  z  component  of  the  particle  angular 
momentum  is  not  a  constant,  there  is  still  a  corresponding  conservation 
law  for  the  quantity 

pv  =  mp2ip  +  \qBp2 . 

The  reason  for  the  existence  of  such  a  conservation  law  will  be  discussed  in 
Chapter  12. 


10.6    The  Stretched  String 

As  a  final  example  of  the  use  of  the  Lagrangian  method,  we  consider  a 
rather  different  kind  of  problem.  This  is  an  example  of  a  system  with  an 
infinite  number  of  degrees  of  freedom  —  a  string  of  length  I,  and  mass  p 
per  unit  length,  with  fixed  ends,  and  stretched  to  a  tension  F. 

We  shall  consider  small  transverse  oscillations  of  the  string.  In  place  of 
a  finite  set  of  generalized  co-ordinates,  we  now  have  a  continuous  function, 
the  displacement  y(x,  t)  of  the  string  from  its  equilibrium  position.  For  the 
partial  derivatives  of  y  we  use  the  notations 

dy  ,  dy 

V=W  V=dx- 

The  kinetic  energy  of  a  small  element  of  string  of  length  da:  is  ^(pdx)y2. 
Tims  the  total  kinetic  energy  is 


t=  f  yy2dx. 

Jo 


When  the  string  is  in  equilibrium,  its  length  is  I.  However,  when  it  is 
displaced,  its  length  is  increased,  say  to  I  +  Al,  and  is  given  by  (3.34), 
namely 


l  +  Al=[  ^/l+y'2  Ax. 
Jo 

The  work  done  against  the  tension  in  increasing  the  length  by  Al  is  F  Al. 
This  is  the  potential  energy  of  the  string.  For  small  displacements,  we  can 
replace  yjl  +  y'2  by  1  +  \y'2 ■  Thus  we  may  take 

V  =  [  \Fy'2dx.  (10.32) 
Jo 
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Our  Lagrangian  function  is,  therefore, 


Jo 


/'-.'/ 


•2 


(10.33) 


As  before,  we  can  use  Hamilton's  principle  to  derive  the  equation  of 
motion.  To  do  this,  it  will  be  useful  to  consider  a  more  general  problem,  in 
which  the  Lagrangian  function  has  the  form 


L  =  [  £(y,y>y')<te, 

Jo 


(10.34) 


where  C  may  be  called  the  Lagrangian  density.    (In  our  particular  case, 
C,  given  by  the  integrand  of  (10.33),  is  actually  independent  of  y,  and  a 
function  only  of  its  derivatives.) 
The  action  integral  is 


I=(     [  C(y,y,y')dxdt. 

J  t0  J0 


Let  us  consider  a  small  variation  of  the  displacement  5y(x,  t)  which  vanishes 
at  to  and  t\ ,  and  also  (because  the  ends  of  the  string  are  fixed)  at  x  =  0 
and  x  =  I.  The  variation  of  the  action  integral  is 


61 


dC5  |  dCd(8y)  +  dC  d{8y) 
dy         dy    dt       dy1  dx 


dx  dt. 


Now,  as  in  §3.6,  we  integrate  by  parts,  with  respect  to  t  in  the  second  term, 
and  with  respect  to  x  in  the  third.  In  both  cases,  the  integrated  terms 
vanish  because  8y  is  zero  at  the  limits  of  integration.  Thus  we  obtain 


51 


dC 

dy 


d_ 

dt 


dC 

dy 


d_ 

dx 


dC_ 

dy1 


5y{x,  t)  dx  dt. 


Hamilton's  principle  requires  that  this  expression  should  be  zero  for  arbi- 
trary variations  Sy(x,  t)  vanishing  at  the  limits.  This  is  possible  only  if  the 
integrand  vanishes  identically.  Hence  we  obtain  Lagrange's  equation 


dC 

I 

fdc\ 

-± 

dy 

dt 

\dy) 

dx 

[dy') 

0. 


(10.35) 


In  the  particular  case  of  the  stretched  string,  the  derivatives  of  the 
Lagrangian  density  are 
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Thus  Lagrange's  equation  becomes 

y  =  c2y",        where       c2  =  F/fi.  (10.36) 

In  place  of  a  discrete  set  of  ordinary  differential  equations  for  the  gener- 
alized co-ordinates  qa(t),  we  now  have  a  partial  differential  equation  for 
the  function  y(x,t)  of  two  independent  variables.  Note  that  c2  has  the 
dimensions  of  (velocity)2.  In  fact,  as  we  show  below,  c  is  the  velocity  of 
propagation  of  waves  along  the  string. 

Equation  (10.36)  is  the  one-dimensional  wave  equation.  Similar  equa- 
tions turn  up  in  many  branches  of  physics,  whenever  wave  phenomena  are 
encountered. 

Example:  Solution  of  the  stretched-string  equation 

Given  the  initial  displacement  y(x,  0)  and  velocity  y(x,  0)  of  the 
string  find  its  displacement  at  a  later  time  t. 

Ignoring  for  the  moment  the  boundary  conditions  at  x  =  0  and  x  =  I, 
the  general  solution  of  the  wave  equation  (10.36)  involves  two  arbitrary 
functions,  which  may  be  determined  by  the  initial  values  of  y  and  y.  It 
is  easy  to  verify  by  direct  substitution  that,  for  any  function  /, 

y{x,t)  =  f(x  -  ct) 

is  a  solution.  It  represents  a  wave  travelling  along  the  string  with  velocity 
c  to  the  right;  for,  the  shape  of  the  displacement  y  is  the  same  at  time 
t  as  at  time  0,  but  shifted  to  the  right  by  a  distance  ct  (see  Fig.  10.3). 
Similarly 

y{x,t)  =  f(x  +  ct) 

is  a  solution,  representing  a  wave  travelling  to  the  left  along  the  string. 
The  general  solution  is 

y  =  f(x  +  ct)  +g(x  -  ct), 

where  /  and  g  are  arbitrary  functions. 

To  satisfy  the  boundary  conditions  at  x  =  0  and  x  =  I  both  terms  in 
the  solution  must  be  present.  For  a  string  of  finite  length,  a  wave  cannot 
travel  indefinitely  in  one  direction.  When  it  gets  to  the  end  of  the  string 
it  must  be  reflected  back. 
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Fig.  10.3 


The  condition  y  =  0  at  x  =  0  requires  that 

f(ct)+g(-ct)  =  0 

for  all  t.  Thus  the  functions  /  and  g  differ  only  in  sign,  and  in  the  sign 
of  the  argument,  and  we  can  write  the  solution  as 

y  =  f(x  +  ct)-f(ct-x).  (10.37) 

The  condition  y  =  0  at  x  =  I  requires  that 

f(ct  +  1)-  f(ct  -0=0 

for  all  t.  In  other  words,  /  must  be  a  periodic  function,  of  period  21, 
satisfying 

f(x  +  2l)  =  f(x).  (10.38) 

To  establish  the  connection  between  the  periodic  function  /  and  the 
initial  conditions,  let  us  write  /  as  the  sum  of  an  even  function  and  an 
odd  function: 

f(x)  =  \[h(x)+k{x)},  (10.39) 

where 

h(x)=f(x)  +  f(-x), 
k(x)  =  f(x)  -  f(-x). 

To  fix  /,  we  have  to  specify  its  values  in  the  interval  from  —I  to  I,  or, 
equivalently,  to  specify  both  functions  h  and  k  in  the  interval  from  0  to 
I.  Now  from  (10.37),  the  initial  value  of  y  is 

y(x,0)  =  f(x)  -  f(-x)  =  k(x).  (10.40) 
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This  therefore  determines  the  odd  part  of  /.  Moreover,  the  initial  value 
of  y  is 

y(x,  0)  =  cf'(x)  -  cf'(-x)  =  ch'(x).  (10.41) 

(Remember  that  [f(—x)]'  =  —f'(—x).)  Thus  the  initial  value  of  y  deter- 
mines the  even  part  of  /  (up  to  an  irrelevant  additive  constant,  which 
cancels  in  (10.37)). 

10.7  Summary 

The  position  of  every  part  of  a  system  may  be  fixed  by  specifying  the  values 
of  a  set  of  generalized  co-ordinates.  If  these  co-ordinates  can  all  vary  inde- 
pendently, the  system  is  holonomic.  This  is  the  case  in  all  the  examples  we 
have  considered.  The  system  is  natural  if  the  functions  specifying  the  posi- 
tions of  particles  in  terms  of  the  generalized  co-ordinates  do  not  involve  the 
time  explicitly.  In  that  case,  the  kinetic  energy  is  a  homogeneous  quadratic 
function  of  the  qa.  For  a  forced  system,  on  the  other  hand,  T  may  contain 
linear  and  constant  terms.  In  either  case,  the  equations  of  motion  are  given 
by  Lagrange's  equations.  If  the  forces  are  conservative  (and  sometimes  in 
other  cases  too),  all  we  need  is  the  Lagrangian  function  L  =  T  —  V.  In 
general,  for  dissipative  forces,  the  generalized  forces  Fa  corresponding  to 
the  generalized  co-ordinates  qa  must  be  found  by  evaluating  the  work  done 
in  a  small  displacement. 


Problems 

1.  Masses  m  and  2m  are  joined  by  a  light  inextensible  string  which  runs 
without  slipping  over  a  uniform  circular  pulley  of  mass  2m  and  radius 
a.  Using  the  angular  position  of  the  pulley  as  generalized  co-ordinate, 
write  down  the  Lagrangian  function,  and  Lagrange's  equation.  Find 
the  acceleration  of  the  masses. 

2.  A  uniform  cylindrical  drum  of  mass  M  and  radius  a  is  free  to  rotate 
about  its  axis,  which  is  horizontal.  A  cable  of  negligible  mass  and  length 
I  is  wound  on  the  drum,  and  carries  on  its  free  end  a  mass  m.  Write 
down  the  Lagrangian  function  in  terms  of  an  appropriate  generalized 
co-ordinate,  assuming  no  slipping  or  stretching  of  the  cable.  If  the  cable 
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is  initially  fully  wound  up,  and  the  system  is  released  from  rest,  find 
the  angular  velocity  of  the  drum  when  it  is  fully  unwound. 

3.  Treat  the  system  of  Problem  2  as  one  with  two  generalized  co-ordinates, 
the  angular  position  of  the  drum  and  the  free  length  of  cable,  with  an 
appropriate  constraint.  Hence  find  the  tension  in  the  cable.  (Show  that 
it  is  equal  to  the  Lagrange  multiplier.) 

4.  *Find  the  Lagrangian  function  for  the  system  of  Problem  2  if  the  cable 
is  elastic,  with  elastic  potential  energy  ^kx2,  where  x  is  the  extension  of 
the  cable.  Show  that  the  motion  of  the  mass  m  is  a  uniform  acceleration 
at  the  same  rate  as  before,  with  a  superimposed  oscillation  of  angular- 
frequency  given  by  u>2  =  k(M  +  2m) /Mm.  Find  the  amplitude  of  this 
oscillation  if  the  system  is  released  from  rest  with  the  cable  unextended. 

5.  Write  down  the  kinetic  energy  of  a  particle  in  cylindrical  polar  co- 
ordinates in  a  frame  rotating  with  angular  velocity  u>  about  the  0-axis. 
Show  that  the  terms  proportional  to  ui  and  ut2  reproduce  the  Coriolis 
force  and  centrifugal  force  respectively. 

6.  A  light  inextensible  string  passes  over  a  light  smooth  pulley,  and  carries 
a  mass  4m  on  one  end.  The  other  end  supports  a  second  pulley  with 
a  string  over  it  carrying  masses  3m  and  m  on  the  two  ends.  Using 
a  suitable  pair  of  generalized  co-ordinates,  write  down  the  Lagrangian 
function  for  the  system,  and  Lagrange's  equations.  Find  the  downward 
accelerations  of  the  three  masses. 

7.  Find  the  tensions  in  the  strings  in  Problem  6.  Explain  why  the  first 
pulley  turns,  although  the  total  mass  on  each  side  is  the  same. 

8.  Evaluate  accurately  the  two  possible  precessional  angular  velocities  of 
the  top  described  in  Chapter  9,  Problem  13,  if  the  axis  makes  an  angle  of 
30°  with  the  vertical.  Compare  the  slower  value  with  the  approximate 
result  found  earlier.  Find  also  the  minimum  angular  velocity  ui^  for 
which  steady  precession  at  this  angle  is  possible. 

9.  A  simple  pendulum  of  mass  m  and  length  I  hangs  from  a  trolley  of 
mass  M  running  on  smooth  horizontal  rails.  The  pendulum  swings  in 
a  plane  parallel  to  the  rails.  Using  the  position  x  of  the  trolley  and  the 
angle  of  inclination  9  of  the  pendulum  as  generalized  co-ordinates,  write 
down  the  Lagrangian  function,  and  Lagrange's  equations.  Obtain  an 
equation  of  motion  for  0  alone.  If  the  system  is  released  from  rest  with 
the  pendulum  inclined  at  30°  to  the  vertical,  use  energy  conservation 
to  find  its  angular  velocity  when  it  reaches  the  vertical,  given  that 
M  =  2kg,  m  =  1kg,  and  I  =  2m. 
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10.  *Show  that  the  kinetic  energy  of  the  gyroscope  described  in  Chapter  9, 
Problem  21,  is 

T  =  i/i(ftsinAcos</?)2  +  \h{ip  +  ficosA)2  +  ^I3(ip  +  SlsinAsint/?)2. 

From  Lagrange's  equations,  show  that  the  angular  velocity  ujz  about 
the  axis  is  constant,  and  obtain  the  equation  for  ip  without  neglecting 
Q2.  Show  that  motion  with  the  axis  pointing  north  becomes  unstable 
for  very  small  values  of  L03,  and  find  the  smallest  value  for  which  it 
is  stable.  What  are  the  stable  positions  when  UJ3  =  0?  Interpret  this 
result  in  terms  of  a  non-rotating  frame. 

11.  *Find  the  Lagrangian  function  for  a  symmetric  top  whose  pivot  is  free 
to  slide  on  a  smooth  horizontal  table,  in  terms  of  the  generalized  co- 
ordinates X,  Y,  ip,  6,  ip,  and  the  principal  moments  1%  about  the 
centre  of  mass.  (Note  that  Z  is  related  to  6.)  Show  that  the  horizontal 
motion  of  the  centre  of  mass  may  be  completely  separated  from  the 
rotational  motion.  What  difference  is  there  in  the  equation  (10.15)  for 
steady  precession?  Are  the  precessional  angular  velocities  greater  or 
less  than  in  the  case  of  a  fixed  pivot?  Show  that  steady  precession  at 
a  given  value  of  6  can  occur  for  a  smaller  value  of  uiz  than  in  the  case 
of  a  fixed  pivot. 

12.  *A  uniform  plank  of  length  2a  is  placed  with  one  end  on  a  smooth 
horizontal  floor  and  the  other  against  a  smooth  vertical  wall.  Write 
down  the  Lagrangian  function,  using  two  generalized  co-ordinates,  the 
distance  x  of  the  foot  of  the  plank  from  the  wall,  and  its  angle  6  of 
inclination  to  the  horizontal,  with  a  suitable  constraint  between  the 
two.  Given  that  the  plank  is  initially  at  rest  at  an  inclination  of  60°, 
find  the  angle  at  which  it  loses  contact  with  the  wall.  (Hint:  First  write 
the  co-ordinates  of  the  centre  of  mass  in  terms  of  x  and  6.  Note  that 
the  reaction  at  the  wall  is  related  to  the  Lagrange  multiplier.) 

13.  Use  Hamilton's  principle  to  show  that  if  F  is  any  function  of  the  general- 
ized co-ordinates,  then  the  Lagrangian  functions  L  and  L  +  dF/dt  must 
yield  the  same  equations  of  motion.  Hence  show  that  the  equations  of 
motion  of  a  charged  particle  in  an  electromagnetic  field  are  unaffected 
by  the  'gauge  transformation'  (A. 42).  (Hint:  Take  F  =  —qA.) 

14.  The  stretched  string  of  §10.6  is  released  from  rest  with  its  mid-point 
displaced  a  distance  a,  and  each  half  of  the  string  straight.  Find  the 
function  f(x).  Describe  the  shape  of  the  string  after  (a)  a  short  time, 
(b)  a  time  I /2c,  and  (c)  a  time  l/c. 
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15.  *Two  bodies  of  masses  Mi  and  Mi  are  moving  in  circular  orbits  of 
radii  oi  and  02  about  their  centre  of  mass.  The  restricted  three-body 
problem  concerns  the  motion  of  a  third  small  body  of  mass  m  (-C  M\ 
or  M2)  in  their  gravitational  field  (e.g.,  a  spacecraft  in  the  vicinity  of 
the  Earth-Moon  system).  Assuming  that  the  third  body  is  moving  in 
the  plane  of  the  first  two,  write  down  the  Lagrangian  function  of  the 
system,  using  a  rotating  frame  in  which  M\  and  M2  are  fixed.  Find 
the  equations  of  motion.  (Hint:  The  identities  GM\  =  u>2a2a2  and 
GM2  =  <jJ2a2a\  may  be  useful,  with  a  =  a\  +  a2  and  uj2  =  GM/a3.) 

16.  *For  the  system  of  Problem  15,  find  the  equations  that  must  be  satisfied 
for  'equilibrium'  in  the  rotating  frame  (i.e.,  circular  motion  with  the 
same  angular  velocity  as  Mi  and  M2).  Consider  'equilibrium'  positions 
on  the  line  of  centres  of  Mi  and  M2.  By  roughly  sketching  the  effec- 
tive potential  energy  curve,  show  that  there  are  three  such  positions, 
but  that  all  three  are  unstable.  (Note:  The  positions  are  actually  the 
solutions  of  a  quintic  equation.)  Show  also  that  there  are  two  'equi- 
librium' positions  off  the  line  of  centres,  in  each  of  which  the  three 
bodies  form  an  equilateral  triangle.  (The  stability  of  these  so-called 
Lagrangian  points  is  the  subject  of  Problem  12,  Chapter  12.  There  is 
further  consideration  of  this  important  problem  in  §14.4.) 
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Chapter  11 

Small  Oscillations  and  Normal  Modes 


In  this  chapter,  we  shall  discuss  a  generalization  of  the  harmonic  oscillator 
problem  treated  in  §2.2  —  the  oscillations  of  a  system  of  several  degrees 
of  freedom  near  a  position  of  equilibrium.  We  consider  only  conservative, 
holonomic  systems,  described  by  n  generalized  co-ordinates  qi,Q2, .  ..,q„. 
Without  loss  of  generality,  we  may  choose  the  position  of  equilibrium  to  be 
?i  =  <?2  =  •  •  •  =  Qn  =  0.  We  shall  begin  by  investigating  the  form  of  the 
kinetic  and  potential  energy  functions  near  this  point. 

11.1     Orthogonal  Co-ordinates 

We  shall  restrict  our  attention  to  natural  systems,  for  which  the  kinetic 
energy  is  a  homogeneous  quadratic  function  of  q\,  <?2,  •  •  • ,  qn-  (More  gener- 
ally, we  could  include  also  those  forced  systems  —  like  that  of  §10.4  —  for 
which  L  can  be  written  as  a  sum  of  a  quadratic  term  T'  and  a  term  —  V 
independent  of  the  time  derivatives.  The  only  essential  restriction  is  that 
there  should  be  no  linear  terms.) 
For  example,  for  n  =  2,  we  have 

T  =  §Clll<Zi  +  a12<h?2  + 

In  general,  the  coefficients  an,  a\2,  an  will  be  functions  of  q\  and  q^.  How- 
ever, if  we  are  interested  only  in  small  values  of  gi  and  qi,  we  may  neglect 
this  dependence,  and  treat  them  as  constants  (equal  to  their  values  at 
q\  =  qi  =  0). 

For  a  particle  described  by  curvilinear  co-ordinates,  the  co-ordinates  are 
called  orthogonal  if  the  co-ordinate  curves  always  intersect  at  right  angles 
(see  §3.5).  In  that  case,  the  kinetic  energy  contains  terms  in  gi,<jf,g§, 
but  no  cross  products  like  q\qi-  By  an  extension  of  this  terminology,  the 
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generalized  co-ordinates  qi,  q2,  ■  ■ . ,  qn  are  called  orthogonal  if  T  is  a  sum  of 
squares,  with  no  cross  products  —  for  example,  if  a  12  above  is  zero. 

It  is  a  considerable  simplification  to  choose  the  co-ordinates  to  be  or- 
thogonal, and  this  can  always  be  done.  For  instance,  we  may  set 


Qi  =  Qi  +  Q2, 

an 


so  that,  in  terms  of  q[  and  52,  (H-l)  becomes 


T  =  \anq[2  +  \o!22q\,        with       a'22  =  a22  — .  (11.2) 

an 

We  can  even  go  further.  Since  T  is  necessarily  positive  for  all  possible  q[, 
q2,  the  coefficients  in  (11.2)  must  be  positive  numbers.  Hence  we  can  define 
new  co-ordinates 


so  that  T  is  reduced  to  a  simple  sum  of  squares: 

T=\&2  +  \&2.  (11.3) 

A  similar  procedure  can  be  used  in  the  general  case.  (It  is  called  the 
Gram-Schmidt  orthogonalization  procedure.)  We  may  first  eliminate  the 
cross  products  involving  <ji  by  means  of  the  transformation  to 

,  ,    <ll2  .  aln 

Qi  =  9i  H  92  H  1  qn, 

an  an 

then  those  involving  q2,  and  so  on.  Thus  we  can  always  reduce  T  to  the 
standard  form 


where  we  now  drop  any  primes. 

As  an  illustration  of  these  ideas,  let  us  consider  the  double  pendulum 
illustrated  in  Fig.  11.1. 
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Example:  Kinetic  energy  of  the  double  pendulum 


m 


Fig.  11.1 

A  double  pendulum  consists  of  a  simple  pendulum  of  mass  M 
and  length  L,  with  a  second  simple  pendulum  of  mass  m  and 
length  I  suspended  from  it.  Consider  only  motion  in  a  vertical 
plane,  so  that  the  system  has  two  degrees  of  freedom.  Find  a 
standard  set  of  co-ordinates  in  terms  of  which  the  kinetic  energy 
takes  the  form  (11.4). 

As  generalized  co-ordinates,  we  may  initially  choose  the  inclinations 
9  and  tp  to  the  downward  vertical.  The  velocity  of  the  upper  pendulum 
bob  is  L9.  That  of  the  lower  bob  has  two  components  —  the  velocity  L9 
of  its  point  of  support,  and  the  velocity  Itp  relative  to  that  point.  The 
angle  between  these  is  tp  —  9.  Hence  the  kinetic  energy  is 

T  =  \ML292  +  \m[L292  +  I2 ip2  +  2LWtp  cos(tp  -  B)\. 

For  small  values  of  9  and  tp,  we  may  approximate  cos(<£  —  9)  by  1.  Since 
there  is  a  term  in  9(f,  these  co-ordinates  are  not  orthogonal,  but  we  can 
make  them  so  by  adding  an  appropriate  multiple  of  tp  to  9,  or,  equally 
well,  of  9  to  tp.  In  fact,  it  is  easy  to  see  physically  that  a  pair  of  orthogonal 
co-ordinates  is  provided  by  the  displacements  of  the  two  bobs,  which  for 
small  angles  are 

x  =  L9,       y  =  L9  +  l<p.  (11.5) 
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In  terms  of  these,  T  becomes 

T=\Mx2  +  \mij2.  (11.6) 

To  complete  the  reduction  to  the  standard  form  (11.4),  we  may  define 
the  new  co-ordinates 


qi  =  vMx,       q2  =  \rmy.  (H-7) 

In  practice,  this  is  not  always  essential,  though  it  makes  the  discussion 
of  the  general  case  much  easier. 


11.2    Equations  of  Motion  for  Small  Oscillations 

Now  let  us  consider  the  potential  energy  function  V.  With  T  given  by 
(11.4),  the  equations  of  motion  are  simply 

dV 

qa  =  -- — ,       for       a  =  1,2,  ...,n.  (11.8) 

dqa 

Thus  the  condition  for  equilibrium  is  that  all  n  partial  derivatives  of  V 
should  vanish  at  the  equilibrium  position. 

For  small  values  of  the  co-ordinates,  we  can  expand  V  as  a  series,  just 
as  we  did  for  a  single  co-ordinate  in  §2.2.  For  example,  for  n  =  2, 

V  =  VQ  +  (bm  +  b2q2)  +  (5*119?  +  ^129192  +  \k22ql)  +  ■■■  ■ 

The  equilibrium  conditions  require  that  the  linear  terms  should  be  zero, 
bi  =  b2  =  0,  just  as  in  §2.2.  Moreover,  the  constant  term  Vq  is  arbitrary, 
and  may  be  set  equal  to  zero  without  changing  the  equations  of  motion. 
Thus  the  leading  terms  are  the  quadratic  ones,  and  for  small  values  of  q\ 
and  92,  we  may  approximate  V  by 

V  =  ifcug?  +k12qiq2  +  \k22ql.  (11.9) 

Then  the  equations  of  motion  (11.8)  become 

q\  =  -hm  -  k12q2,  (1110) 
<?2  =  -&2191  -  k22q2, 


where  for  the  sake  of  symmetry  we  have  written  fei  =  fci2  in  the  second 
equation. 
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In  the  general  case,  V  may  be  taken  to  be  a  homogeneous  quadratic 
function  of  the  co-ordinates,  which  can  be  written 


a=10=l 


(11.11) 


with  k(ja  =  ka0.  (Notice  that  each  term  with  a  ^  [3  appears  twice,  for 
example  \k\2q\q2  and  \k2\q2q\,  which  are  of  course  equal.)  Then  the 
equations  of  motion  are 


qa  =  -  Y  kapqp,        for       a  =  1,  2, . . . ,  n, 
0=1 


or,  in  matrix  notation, 


~qi 

h 

An. 

fell   fcl2  ...  kin 

k2i  k22  ...  k2„ 
knl  kn2  •  •  •  kn 


~qi 

Sin. 

(11.12) 


(11.13) 


Example:  Equations  of  motion  of  the  double  pendulum 


Find  the  equations  of  motion  of  the  double  pendulum,  in  terms 
of  the  orthogonal  co-ordinates  x  and  y. 

The  potential  energy  of  the  double  pendulum  is  easily  seen  to  be 

V  =  (M  +  m)gL(l  -  cos  9)  +  rngl(l  -  cos  ip). 

For  small  angles,  we  may  approximate  (1  —  cos  9)  by  \92 .  Hence  in  terms 
of  x  and  y,  we  have 

V  =  i(M  +  m)gL92  +  \mgl<p 

Thus  the  equations  of  motion  are 

(M  +  m)g  mg 

L  T 

rng 


,     (M  +  m)9x2  +  ^{y_x)2 


2L 


Mx~ 

my 

mg 

~T 
_mg 

I 


.1: 

y 

(11.14) 


Note  the  appearance  of  the  masses  on  the  left  hand  side,  because  we 
have  not  gone  over  to  the  normalized  variables  of  (11.7). 
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11.3    Normal  Modes 

The  general  solution  of  a  pair  of  second-order  differential  equations  like 
(11.14)  must  involve  four  arbitrary  constants,  which  may  be  fixed  by  the 
initial  values  of  qi,  <?2,  qi,  Q2-  Similarly,  the  general  solution  of  (11.13)  must 
involve  2n  arbitrary  constants.  To  find  the  general  solution,  we  adopt  a 
generalization  of  the  method  used  for  the  damped  harmonic  oscillator  in 
§2.5:  we  look  first  for  solutions  in  which  all  the  co-ordinates  are  oscillating 
with  the  same  frequency  uj,  of  the  form 

qa  =  Aaeiu,t,  (11.15) 

where  the  Aa  are  complex  constants.  (As  in  §2.3  and  §2.5,  the  physical 
solution  may  be  taken  to  be  the  real  part  of  (11.15).)  Such  solutions  are 
called  normal  modes  of  the  system. 

Substituting  (11.15)  into  (11.13),  we  obtain  a  set  of  n  simultaneous 
equations  for  the  n  amplitudes  An, 

n 

-^2Aa  =  -YJka0A[}.  (11.16) 

13=1 

Let  us  consider  first  the  case  n  =  2.  Then  these  equations  are 


hi 

ki2 

'a; 

~A{ 

kn 

k22_ 

A*. 

=  uj2 

A 

(11.17) 


This  is  what  is  known  as  an  eigenvalue  equation.  The  values  of  u>2  for  which 
non-zero  solutions  exist  are  called  the  eigenvalues  of  the  2x2  matrix  with 
elements  kap.  The  column  vector  formed  by  the  Aa  is  an  eigenvector  of 
the  matrix. 

The  equations  (11.17)  can  alternatively  be  written  as 

fcn  -  UJ2  fei2 
k21        k22  -  OJ2 

These  equations  have  a  non-zero  solution  if  and  only  if  the  determinant  of 
the  coefficient  matrix  vanishes,  i.e.,  if  and  only  if 

=  (fcn  -  LU2)(k22  -UJ2)  -  k\2  =  0.  (11.18) 


Ai 

0 

A2 

0 

fell  -  UJ2  ki2 

k2i      k22  -  u)2 
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This  is  called  the  characteristic  equation  for  the  system.  It  determines 
the  frequencies  u>  of  the  normal  modes,  which  are  the  square  roots  of  the 
eigenvalues  uj2. 

Equation  (11.18)  is  a  quadratic  equation  for  uj2.  Its  discriminant  may 
be  written  (ku  —  /C22)2  +  4A;22,  which  is  clearly  positive.  Hence  it  always 
has  two  real  roots.  The  condition  for  stability  is  that  both  roots  should  be 
positive.  A  negative  root,  say  —  j2,  would  yield  a  solution  of  the  form 


where  both  the  Aa  and  the  Ba  coefficients  constitute  eigenvectors  of  the 
2x2  matrix,  corresponding  to  the  eigenvalue  — 72.  Except  in  the  degenerate 
case  of  two  equal  eigenvalues,  this  means  that  the  Aa  and  Ba  coefficients 
must  be  proportional,  since  the  eigenvector  is  unique  up  to  an  overall  factor. 
In  general,  therefore,  the  solution  yields  an  exponential  increase  in  the 
displacements  with  time. 

This  stability  condition,  that  all  the  eigenvalues  be  positive,  is  a  natural 
generalization  of  the  requirement,  in  the  one-dimensional  case,  that  the 
second  derivative  of  the  potential  energy  function  be  positive  (see  §2.2). 

If  uj2  is  chosen  equal  to  one  of  the  two  roots  of  (11.18),  then  either  of 
the  two  equations  in  (11.17)  fixes  the  ratio  A1/A2.  Since  the  coefficients 
are  real  numbers,  the  ratio  is  obviously  real.  (This  is  a  special  case  of  a 
general  theorem  about  symmetric  matrices  —  those  satisfying  kpa  =  kap. 
See  §A.10.)  This  means  that  A\  and  A2  have  the  same  phase  (or  phases 
differing  by  ir),  so  that  q\  and  q2  not  only  oscillate  with  the  same  frequency, 
but  actually  in  (or  directly  out  of)  phase.  The  ratio  of  qi  to  q2  remains 
fixed  throughout  the  motion. 

There  remains  in  A\  and  A2  a  common  arbitrary  complex  factor,  which 
serves  to  fix  the  overall  amplitude  and  phase  of  the  normal  mode  solution. 
Thus  each  normal  mode  solution  contains  two  arbitrary  real  constants. 

Since  the  equations  of  motion  (11.10)  are  linear,  any  superposition  of 
solutions  is  again  a  solution.  Hence  the  general  solution  is  simply  a  super- 
position of  the  two  normal  mode  solutions.  If  uj2  and  uj'2  are  the  roots  of 
(11.18),  it  may  be  written  as  the  real  part  of 


qa  = 


qi  =  Aieiart  +  A'ieia/t, 
q2  =  A2eiult  +  A'2eiu't, 


(11.19) 


in  which  the  ratios  Ai/A2  and  A\fA2  are  fixed  by  (11.17). 
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Example:  Normal  modes  of  the  double  pendulum 

Find  the  normal  mode  frequencies  of  the  double  pendulum. 
Here  the  equations  (11.17)  may  be  written 


(M  +  m)g     mg  mg' 

ML      +  Wl  ~M~l 

_9  9 

I  I  ■ 


Ax 

Ax 

=  cu2 

Ay 

The  characteristic  equation  (11.18)  simplifies  to 
M  +  rn  ( g      —  1  2 


(M) 


2  M  +  mg2 
uj  +  ■ 


0. 


(11.20) 


(11.21) 


M     \L     U  M  LI 

The  roots  of  this  equation  determine  the  frequencies  of  the  two  normal 
modes. 


It  is  interesting  to  examine  special  limiting  cases.  First,  let  us  suppose 
that  the  upper  pendulum  is  very  heavy  (M  3>  m).  Then,  provided  that  I  is 
not  too  close  to  L,  the  two  roots,  with  the  corresponding  ratios  determined 
by  (11.20),  are,  approximately 

2  ~  £  ~  m  L 

W  ~  I '       Av  ~  Ml  -  V 


and 


9_ 
V 


Ax 
A-t, 


In  the  first  mode,  the  upper  pendulum  is  practically  stationary,  while  the 
lower  is  swinging  with  its  natural  frequency.  In  the  second  mode,  whose 
frequency  is  that  of  the  upper  pendulum,  the  amplitudes  are  comparable. 
At  the  other  extreme,  if  M  -C  m,  the  two  normal  modes  are 

o        9  Ax  ^  L 

L  +  V       Av  ~  L  +  V 


and 


MXLi)1  Ay 


rn  L  +  i 
M  L 


In  the  first  mode,  the  pendulums  swing  almost  like  a  single  rigid  pendulum 
of  length  L  +  I.  In  the  second,  the  lower  bob  remains  almost  stationary, 
while  the  upper  one  executes  a  very  rapid  oscillation. 
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The  normal  modes  of  a  system  with  n  degrees  of  freedom  may  be  found 
by  a  very  similar  method.  The  condition  for  consistency  of  the  simultaneous 
equations  (11.16)  is  again  that  the  determinant  of  the  coefficients  should 
vanish.  For  example,  for  n  =  3,  we  require 


This  is  a  cubic  equation  for  ui2.  It  can  be  proved  that  its  three  roots  are 
all  real  (see  §A.10).  As  before,  the  condition  for  stability  is  that  all  three 
roots  should  be  positive.  The  roots  then  determine  the  frequencies  of  the 
three  normal  modes. 

For  each  normal  mode,  the  ratios  of  the  amplitudes  are  fixed  by  the 
equations  (11.16).  As  in  the  case  n  =  2,  these  ratios  are  all  real,  so  that  in 
a  normal  mode  all  the  co-ordinates  oscillate  in  phase  (or  180°  out  of  phase). 
Each  normal  mode  solution  involves  just  two  arbitrary  real  constants,  and 
the  general  solution  is  a  superposition  of  all  the  normal  modes. 

11.4    Coupled  Oscillators 

One  often  encounters  examples  of  physical  systems  that  may  be  described  as 
two  (or  more)  harmonic  oscillators,  which  are  approximately  independent, 
but  with  some  kind  of  relatively  weak  coupling  between  the  two.  (As  a 
specific  example,  we  shall  consider  below  the  system  shown  in  Fig.  11.2, 
which  consists  of  a  pair  of  identical  pendulums  coupled  by  a  spring.) 

If  the  co-ordinate  q  of  a  harmonic  oscillator  is  normalized  so  that  T  = 
\q2 ,  then  V  =  \ui2q2 ,  where  u>  is  the  angular  frequency.  Hence  for  a  pair 
of  uncoupled  oscillators,  the  coefficients  in  (11.10)  are 


When  the  oscillators  are  weakly  coupled,  these  equalities  will  still  be  ap- 
proximately true,  so  that  in  particular  k\2  is  small  in  comparison  to  fcn 
and  &22-  Thus  from  (11.18)  it  is  clear  that  the  characteristic  frequencies 
of  the  system  are  given  by  u2  «  kn  and  uj2  ss  ^22;  as  one  might  expect, 
they  are  close  to  the  frequencies  of  the  uncoupled  oscillators.  Then  from 
(11.17)  we  see  that,  in  the  first  normal  mode,  the  ratio  A2/A1  is  approx- 
imately kn/{kn  —  £22)-  Thus,  unless  the  frequencies  of  the  two  normal 
modes  are  nearly  equal,  the  normal  modes  differ  very  little  from  those  of 


fcn  -  LO2        fci2  fcl3 
&21        &22  -  U2  k%i 

hi         ki2     h:i  -  u2 


=  0. 


(11.22) 


fcl2  =  0 


262 


Classical  Mechanics 


the  uncoupled  system,  and  the  coupling  is  not  of  great  importance.  The 
interesting  case,  in  which  even  a  weak  coupling  can  be  important,  is  that 
in  which  the  two  frequencies  are  equal,  or  nearly  so. 

As  a  specific  example  of  this  case,  we  consider  a  pair  of  pendulums,  each 
of  mass  m  and  length  I,  coupled  by  a  weak  spring  (see  Fig.  11.2).  We  shall 


Fig.  11.2 


use  the  displacements  x  and  y  as  generalized  co-ordinates.  Then,  in  the 
absence  of  coupling,  the  potential  energy  is  approximately  imw^a;2  +  y2), 
where  u>q  =  g/l  gives  the  free  oscillation  frequency.  For  small  values  of  x 
and  y,  the  potential  energy  of  the  spring  has  the  form  \k(x  —  y)2.  It  will 
be  convenient  to  introduce  another  frequency,  ujs  defined  by  ui2  =  k/rn.  In 
fact,  ujs  is  the  angular  frequency  of  the  spring  if  one  end  is  held  fixed  and 
the  other  attached  to  a  mass  m.  Thus  we  take 


T  =  \m(x2  +  y2),  V  =  |m(^  +  lu2)(x2  +  y2)  -  muj2xy.  (11.23) 
The  normal  mode  equations  now  read 

[Al        .  [A  1 

(11.24) 


uj2  +  UJ2 


-ws2     uj2  +  ujI 


The  two  solutions  of  the  characteristic  equation  are  easily  seen  to  be 


uj2  =  loI  +  2uj2 


with       Ax/Ay  =  1, 
with       Ax/Ay  =  -1. 
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In  the  first  normal  mode,  the  two  pendulums  oscillate  together  in 
phase,  with  equal  amplitude  (see  Fig.  11.3(a)).  Since  the  spring  is  nei- 
ther expanded  nor  compressed  in  this  motion,  it  is  not  surprising  that  the 
frequency  is  just  that  of  the  uncoupled  pendulums.  In  the  second  normal 
mode,  which  has  a  somewhat  higher  frequency,  the  pendulums  swing  in 
opposite  directions,  alternately  expanding  and  compressing  the  spring  (see 
Fig.  11.3(b)). 


(b) 
Fig.  11.3 

The  general  solution  is  a  superposition  of  these  two  normal  modes,  and 
may  be  written  as  the  real  part  of 

x  =  AeiuJat  +  A'e1"'* 

11.25) 

y  =  Ae1Wot  -  A'e1"  *, 

where  +      .  The  constants  A  and  A'  may  be  determined  by  the 

initial  conditions. 

Example:  Motion  of  coupled  pendulums 

Find  the  solution  for  the  positions  of  the  bobs  if  the  system  is 
released  from  rest  with  one  bob  displaced  a  distance  a  from  its 
equilibrium  position.  Describe  the  motion. 
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Here,  at  t  =  0,  we  have  x  =  a,  y  =  0,  x  =  0,  y  =  0,  whence  we  find 
A  =  A'  =  a/2,  so  the  solution  is 

a;  =  ^acosuiot  +  §acosa/i  =  acosuj-t  cosui+t, 
y  =  ^acosujot  —  ^acosoj't  =  asinu>-t  smui+t, 

where  ui±  =  |(u/  ±  loq),  and  we  have  used  standard  trigonometric  iden- 
tities. 

Since  the  spring  is  weak,  ui'  is  only  slightly  greater  than  u>q,  and 
therefore  uj_  «C  w+.  We  have  beats  between  two  nearly  equal  frequen- 
cies. Thus  we  may  describe  the  motion  as  follows.  Initially,  the  first 
spring  swings  with  angular  frequency  cu+  and  gradually  decreasing  am- 
plitude, acosui-t.  Meanwhile,  the  second  pendulum  starts  to  swing  with 
the  same  angular  frequency,  but  90°  out  of  phase,  and  with  gradually  in- 
creasing amplitude  a  sin  uj-t.  After  a  time  n/2cu- ,  the  first  pendulum  has 
come  momentarily  to  rest,  and  the  second  is  oscillating  with  amplitude 
a.  Then  its  amplitude  starts  to  decrease,  while  the  first  increases  again. 
The  whole  process  is  then  repeated  indefinitely  (though  in  practice  there 
will  of  course  be  some  damping) . 

This  behaviour  should  be  contrasted  with  that  of  a  pair  of  coupled 
oscillators  of  very  different  frequencies.  In  such  a  case,  if  one  is  started 
oscillating,  one  of  the  two  normal  modes  will  have  a  much  larger  amplitude 
than  the  other.  Thus  only  a  very  small  oscillation  will  be  set  up  in  the 
second  oscillator,  and  the  amplitude  of  the  first  will  be  practically  constant. 


Normal  co-ordinates 

The  two  normal  modes  of  this  system  (or  the  n  normal  modes  in  the  general 
case)  are  completely  independent.  We  can  make  this  fact  explicit  by  intro- 
ducing new  'normal'  co-ordinates.  In  the  case  of  the  coupled  pendulums, 
we  introduce,  in  place  of  x  and  y,  the  new  co-ordinates 

-(x  +  y),        q2  =  ^-(x-y).  (11.26) 

In  terms  of  these  co-ordinates,  the  solution  (11.25)  is 

qx  =  AieiuJat,        Ax  =  y/2^A, 

.  ,t  ,   11.27 

q2  =  A2elu  *,        A2  =  VW'. 
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Thus,  in  each  normal  mode,  one  co-ordinate  only  is  oscillating.  Co- 
ordinates with  this  property  are  called  normal  co-ordinates. 

The  independence  of  the  two  normal  co-ordinates  may  also  be  seen 
by  examining  the  Lagrangian  function.  From  (11.23)  and  (11.26),  we 
find 

T=\q\  +  \ql        V  =  §^(<?2  +  ql)  +  ^ 
whence  the  Lagrangian  function  is 

L  =  m-^ll)  +  Wl-^lD-  (11-28) 

In  effect,  we  have  reduced  the  Lagrangian  to  that  for  a  pair  of  uncoupled 
oscillators,  with  angular  frequencies  luq  and  a/. 

The  normal  co-ordinates  are  very  useful  in  studying  the  effect  on  the 
system  of  a  prescribed  external  force. 


Example:  Forced  oscillation  of  coupled  pendulums 

Find  the  amplitudes  of  forced  oscillations  of  the  coupled  pen- 
dulums if  one  of  them  is  subjected  to  a  periodic  force  F(t)  = 
Fi  cosaJii. 

We  write  the  force  as  the  real  part  of  Fie1"1'.  To  find  the  equations 
of  motion  in  the  presence  of  this  force,  we  must  evaluate  the  work  done 
in  a  small  displacement  (see  §10.2).  This  is 

F(t)5x  =  ^L(dqi+5q2). 
•s/2rn 

Hence  the  equations  of  motion  are 

,  Fieiwit 
3i  =  -^o5i  + 


&t  (H-29) 

/2,     ,  rle 


q2  =  -uj'Aq2  + 


/2m 

These  independent  oscillator  equations  may  be  solved  exactly  as  in 
§2.6.  In  particular,  the  amplitudes  of  the  forced  oscillations  are  given 
by 

_  Fx/V^i  _  Fxj^hn 
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The  corresponding  forced  oscillation  amplitudes  for  x  and  y  are  found 
by  solving  (11.26),  and  are 

_  Ai  +  A2  _A1-A2 

Ay 


V2m  V2m 
Note  that  if  the  forcing  frequency  is  very  close  to  luq,  the  first  normal 
mode  will  predominate,  and  the  pendulums  will  swing  in  the  same  di- 
rection; while  if  it  is  close  to  w'  the  second  will  be  more  important.  (Of 
course,  we  should  really  include  the  effects  of  damping,  so  that  the  am- 
plitudes do  not  become  infinite  at  resonance,  and  so  that  transient  effects 
disappear  in  time.) 


11.5    Oscillations  of  Particles  on  a  String 

Consider  a  light  string  of  length  (n  +  1)/,  stretched  to  a  tension  F,  with 
n  equal  masses  m  placed  along  it  at  regular  intervals  I.  We  shall  con- 
sider transverse  oscillations  of  the  particles,  and  use  as  our  generalized 
co-ordinates  the  displacements  yi,  yi,  ■  ■  ■ ,  Dn-  (See  Fig.  11.4,  drawn  for  the 
case  n  =  3.)  Since  the  kinetic  energy  is 


/  /  /  / 


Fig.  11.4 


T=\m{yl+yl  + ■■■  +  £),  (11.30) 

these  co-ordinates  are  orthogonal. 

Next,  we  must  calculate  the  potential  energy.  Let  us  consider  the  length 
of  string  between  the  jth  and  (j  +  l)th  particles.  In  equilibrium,  its  length 
is  I,  but  when  the  particles  are  displaced  it  is 

l  +  Sl=yJl^  +  (yj+1-yj)^l 

assuming  the  displacements  are  small.  This  calculation  also  applies  to  the 
segments  of  string  at  each  end,  provided  we  set  yo  =  y-n+i  =  0.  The  work 
done  against  the  tension  in  increasing  the  length  of  the  string  to  this  extent 


1  + 


(Vi+i  ~  ViY 


2P 
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is  FSl.  Hence,  adding  the  contributions  from  each  segment  of  string,  we 
find  that  the  potential  energy  is 


F 


V  =  2?  [vl  +  (2/2  -  yif  +  •••  +  (!/„-  yn-if  +  yn]  ■ 


(11.31) 


It  is  worth  noting  that  the  potential  energy  of  a  continuous  string  may  be 
obtained  as  a  limiting  case,  as  n  — >  00  and  I  — >  0.  For  small  Z,  (yj+i—yj)2 /I2 
is  approximately  y'2,  so  we  recover  the  expression  (10.32). 

From  (11.30)  and  (11.31),  we  find  that  Lagrange's  equations  are 


2/i 

2/2 


ml 


(-2yi+y2), 


—^yi  -  22/2  +  2/3), 


(11.32) 


Vn 


ml 


2y„). 


It  will  be  convenient  to  write  u>q  =  F/ml.  Then,  substituting  the  normal 
mode  solution  yj  =  Aje1^,  we  obtain  the  equations 


2ui2  -ojI  0 
-uj2  2uj2  -ul 
0    -lj2  2uj2 


0  0 


0    ...  2ujI 


-Ai 

-A{ 

A2 

A2 

A3 

=  UJ2 

A3 

.An^ 

.An_ 

(11.33) 


Let  us  look  at  the  first  few  values  of  n.  For  n  =  1,  there  is  of  course  just 
one  normal  mode,  with  uj2  =  2uSq.  For  n  =  2,  the  characteristic  equation  is 

(2uj2  -  uj2)2  -  w%  =  0, 

and  we  obtain  two  normal  modes: 


UJ 


OJ, 


()■ 


A1/A2 


3wg,       A1/A2  =  -1. 


Now  consider  n  =  3.  The  characteristic  equation  here  is 


2wg  -  u)z 

-OJ2 


0 


2uj2 


-oj2     2uJq  -  ^ 


0. 


(11.34) 
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Expanding  this  determinant  by  the  usual  rules,  we  obtain  a  cubic  equation 
for  uj2: 

(2ujI  -  uj2f  -  2w£(2u;g  -  uj2)  =  0. 

The  roots  of  this  equation  are  2u>2  and  (2  ±  ^/2)u>q.  Hence  we  obtain  the 
three  normal  modes 

uj2  =  (2  -  ^2)uj2,  A1:A2:A3  =  l:y/2:l; 
uj2  =  2oj27  Ax  :  A2  :  A3  =  1  :  0  :  -1; 

uj2  =  (2  +  y/2)uj2,       A1  :  A2  :  A3  =  1  :  -y/2  :  1. 

These  normal  modes  are  illustrated  in  Fig.  11.5. 


Fig.  11.5 

Higher  values  of  n  may  be  treated  similarly.  For  n  =  4,  the  character- 
istic equation  requires  the  vanishing  of  a  4  x  4  determinant,  which  may  be 
expanded  by  similar  rules  to  yield 

{2uj2  -  uj2)4  -  3oj4(2iv%  -  uj2)2  +  uj%  =  0. 

The  roots  of  this  equation  are  given  by 
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Thus  we  obtain  the  four  normal  modes: 


=  0.38  ui2, 

Ax 

A2 

As 

A4 

=  1  :  1.62  : 

1.62  :  1; 

lj2 

=  1.38  U!2, 

Ax 

A2 

A:i 

Ai 

=  1.62  :  1  : 

-1  :  -1.62; 

lj2 

=  2.62  c;2, 

Ax 

A2 

A3 

Ai 

=  1.62  :  -1 

:  -1  :  1.62; 

LO2 

=  3.62  w2, 

Ax 

A2 

A3 

Ai 

=  1  :  -1.62 

:  1.62  :  -1. 

(See  Fig.  11.6.) 


Fig.  11.6 

For  every  value  of  n,  the  slowest  mode  is  the  one  in  which  all  the  masses 
are  oscillating  in  the  same  direction,  while  the  fastest  is  one  in  which  al- 
ternate masses  oscillate  in  opposite  directions.  For  large  values  of  n,  the 
normal  modes  approach  those  of  a  continuous  stretched  string,  which  we 
discuss  in  the  following  section. 

11.6    Normal  Modes  of  a  Stretched  String 

We  now  wish  to  discuss  the  problem  treated  in  §10.6  from  the  point  of  view 
of  normal  modes.  We  start  from  the  equations  of  motion,  (10.36), 

y  =  c2y",       c2  =  F/n,  (11.35) 
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and  look  for  normal  mode  solutions  of  the  form 

y(x,t)  =  A(x)eiujt.  (11.36) 

Substituting  in  (11.35),  we  obtain 

A"(x)  +  k2A(x)  =  0,        k  =  u/c. 

Tims,  in  place  of  a  set  of  simultaneous  equations  for  the  amplitudes  Aj,  we 
obtain  a  differential  equation  for  the  amplitude  function  A(x). 
The  general  solution  of  this  equation  is 

A(x)  =  a  cos  kx  +  b  sin  kx. 

However,  because  the  ends  of  the  string  are  fixed,  we  must  impose  the 
boundary  conditions  A(0)  =  A(l)  =  0.  (Note  that  I  here  is  the  full  length 
of  the  string,  denoted  by  (re  +  1)/  in  the  preceding  section.)  Thus  a  =  0 
and  moreover  sin  kl  =  0.  The  possible  values  of  k  are 

71  7T 

fc=— ,       n  =  l,2,3,....  (11.37) 

Each  of  these  values  corresponds  to  a  normal  mode  of  the  string.  The 
corresponding  angular  frequencies  are 

T17TC 

w=—,       n  =  l,2,3,....  (11.38) 
Note  that  they  are  all  multiples  of  the  fundamental  frequency 

1YC 

-1  =  -=^ 

where  M  is  the  total  mass  of  the  string. 

The  solution  for  the  nth  normal  mode  can  be  written  as 

y(x,  t)  =  Re  (Aneimrct/i)  sin  (11.39) 

where  An  is  an  arbitrary  complex  constant.  It  represents  a  'standing  wave' 
of  wavelength  21 /n,  with  re  —  1  nodes,  or  points  where  y  =  0.  The  first  few 
normal  modes  are  illustrated  in  Fig.  11.7. 

The  general  solution  for  the  stretched  string  is  a  superposition  of  all 
the  normal  modes  (11.39).  It  is  easy  to  establish  the  connection  with  the 
general  solution  (10.37)  obtained  in  the  previous  chapter.   According  to 
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«=  1 


n  =  2 


n  =  3 


n  =  4 


Fig.  11.7 


(10.38),  f(x)  is  a  periodic  function  of  x,  with  period  21.  Hence  it  may  be 
expanded  in  a  Fourier  series  (see  Eq.  (2.45)), 


Thus  the  solution  (10.37)  is 


f(x)=  fn^X/l- 
n=  —  oo 


+oo 


n=  —  oo 


e^/'sin^. 


Now  recall  that  /„  and  /_„  must  be  complex  conjugates  (so  that  the  two 
terms  add  to  give  a  real  contribution  to  y).  Thus  we  may  restrict  the  sum 
to  positive  values  of  n.  (Note  that  there  is  no  contribution  from  n  =  0 
because  the  sine  function  then  vanishes.)  If  we  define  An  =  2i/„,  we  can 
write  the  solution  as 


y(x,  t)  =  2Re  £  Ane™ct/1  sin 


71=1 
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11.7  Summary 

Near  a  position  of  equilibrium  of  any  natural,  conservative  system,  the  ki- 
netic energy  may  be  taken  to  be  a  homogeneous  quadratic  function  of  the 
qa,  with  constant  coefficients,  and  the  potential  energy  to  be  a  homoge- 
neous quadratic  function  of  the  qa .  We  can  always  find  a  set  of  orthogonal 
co-ordinates,  in  terms  of  which  T  is  reduced  to  a  sum  of  squares.  Lagrange's 
equations  then  take  on  a  simple  form.  To  find  the  normal  modes  of  oscilla- 
tion, we  substitute  solutions  of  the  form  qa  =  Aaelujt ,  and  obtain  a  set  of 
simultaneous  linear  equations  for  the  coefficients.  The  condition  for  con- 
sistency of  these  equations  is  the  characteristic  equation,  which  determines 
the  frequencies  of  the  normal  modes.  The  stability  condition  is  that  all  the 
roots  of  this  equation  for  ui2  should  be  positive. 

The  problem  of  finding  the  normal  modes  is  equivalent  to  that  of  find- 
ing normal  co-ordinates,  which  reduce  not  only  T  but  also  V  to  a  sum  of 
squares.  In  terms  of  normal  co-ordinates,  the  system  is  reduced  to  a  set 
of  uncoupled  harmonic  oscillators,  whose  frequencies  are  the  characteristic 
frequencies  of  the  system.  The  general  solution  to  the  equations  of  motion 
is  a  superposition  of  all  the  normal  modes.  In  it,  each  normal  co-ordinate  is 
oscillating  at  its  own  frequency,  and  with  amplitude  and  phase  determined 
by  the  initial  conditions. 

The  linearized  analysis  of  small  amplitude  oscillations  near  to  a  position 
of  stable  equilibrium  in  the  form  of  normal  modes  is  a  technique  which  is 
applicable  generally.  For  some  systems,  which  are  special  but  important, 
the  idea  of  a  normal  mode  may  be  generalized.  Such  systems  may  then 
be  analyzed  as  a  combination  of  'nonlinear'  normal  modes,  where  no  small 
amplitude  approximation  needs  to  be  made  —  see  §14.1. 


Problems 

1.  A  double  pendulum,  consisting  of  a  pair,  each  of  mass  m  and  length 
I,  is  released  from  rest  with  the  pendulums  displaced  but  in  a  straight 
line.  Find  the  displacements  of  the  pendulums  as  functions  of  time. 

2.  Find  the  normal  modes  of  a  pair  of  coupled  pendulums  (like  those  of 
Fig.  11.2)  if  the  two  are  of  different  masses  M  and  m,  but  still  the  same 
length  I.  Given  that  the  pendulum  of  mass  M  is  started  oscillating  with 
amplitude  a,  find  the  maximum  amplitude  of  the  other  pendulum  in 
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the  subsequent  motion.  Does  the  amplitude  of  the  first  pendulum  ever 
fall  to  zero? 

3.  A  spring  of  negligible  mass,  and  spring  constant  (force/extension)  k, 
supports  a  mass  m,  and  beneath  it  a  second,  identical  spring,  carry- 
ing a  second,  identical  mass.  Using  the  vertical  displacements  x  and 
y  of  the  masses  from  their  positions  with  the  springs  unextended  as 
generalized  co-ordinates,  write  down  the  Lagrangian  function.  Find 
the  position  of  equilibrium,  and  the  normal  modes  and  frequencies  of 
vertical  oscillations. 

4.  Three  identical  pendulums  are  coupled,  as  in  Fig.  11.2,  with  springs 
between  the  first  and  second  and  between  the  second  and  third.  Find 
the  frequencies  of  the  normal  modes,  and  the  ratios  of  the  amplitudes. 

5.  The  first  of  the  three  pendulums  of  Problem  4  is  initially  displaced  a 
distance  a,  while  the  other  two  are  vertical.  The  system  is  released  from 
rest.  Find  the  maximum  amplitudes  of  the  second  and  third  pendulums 
in  the  subsequent  motion. 

6.  Three  identical  springs,  of  negligible  mass,  spring  constant  k,  and  nat- 
ural length  a  are  attached  end-to-end,  and  a  pair  of  particles,  each 
of  mass  m,  are  fixed  to  the  points  where  they  meet.  The  system  is 
stretched  between  fixed  points  a  distance  3Z  apart  (I  >  a).  Find  the 
frequencies  of  normal  modes  of  (a)  longitudinal,  and  (b)  transverse 
oscillations. 

7.  *A  bead  of  mass  m  slides  on  a  smooth  circular  hoop  of  mass  M  and 
radius  a,  which  is  pivoted  at  a  point  on  its  rim  so  that  it  can  swing 
freely  in  its  plane.  Write  down  the  Lagrangian  in  terms  of  the  angle  of 
inclination  9  of  the  diameter  through  the  pivot  and  the  angular  position 
ip  of  the  bead  relative  to  a  fixed  point  on  the  hoop.  Find  the  frequencies 
of  the  normal  modes,  and  sketch  the  configuration  of  hoop  and  bead  at 
the  extreme  point  of  each. 

8.  *The  system  of  Problem  7  is  released  from  rest  with  the  centre  of  the 
hoop  vertically  below  the  pivot  and  the  bead  displaced  by  a  small  angle 
<Pq.  Given  that  M  =  8m  and  that  2a  is  the  length  of  a  simple  pendulum 
of  period  1  s,  find  the  angular  displacement  6  of  the  hoop  as  a  function 
of  time.  Determine  the  maximum  value  of  9  in  the  subsequent  motion, 
and  the  time  at  which  it  first  occurs. 

9.  A  simple  pendulum  of  mass  m,  whose  period  when  suspended  from  a 
rigid  support  is  1  s,  hangs  from  a  supporting  block  of  mass  2m  which 
can  move  along  a  horizontal  line  (in  the  plane  of  the  pendulum),  and 
is  restricted  by  a  harmonic-oscillator  restoring  force.  The  period  of  the 
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oscillator  (with  the  pendulum  removed)  is  0.1  s.  Find  the  periods  of  the 
two  normal  modes.  When  the  pendulum  bob  is  swinging  in  the  slower 
mode  with  amplitude  100  mm,  what  is  the  amplitude  of  the  motion  of 
the  supporting  block? 

10.  *The  system  of  Problem  9  is  initially  at  rest,  and  the  pendulum  bob  is 
given  an  impulsive  blow  which  starts  it  moving  with  velocity  0.5ms"1. 
Find  the  position  of  the  support  as  a  function  of  time  in  the  subsequent 
motion. 

11.  *A  particle  of  charge  q  and  mass  m  is  free  to  slide  on  a  smooth  hor- 
izontal table.  Two  fixed  charges  q  are  placed  at  ±aj,  and  two  fixed 
charges  I2q  at  ±2ai.  Find  the  electrostatic  potential  near  the  origin 
(see  §6.2).  Show  that  this  is  a  position  of  stable  equilibrium,  and  find 
the  frequencies  of  the  normal  modes  of  oscillation  near  it. 

12.  *A  rigid  rod  of  length  2a  is  suspended  by  two  light,  inextensible  strings 
of  length  I  joining  its  ends  to  supports  also  a  distance  2a  apart  and 
level  with  each  other.  Using  the  longitudinal  displacement  x  of  the 
centre  of  the  rod,  and  the  transverse  displacements  2/1,2/2  of  its  ends, 
as  generalized  co-ordinates,  find  the  Lagrangian  function  (for  small 
£,2/1)2/2)-  Determine  the  normal  modes  and  frequencies.  (Hint:  First 
find  the  height  by  which  each  end  is  raised,  the  co-ordinates  of  the 
centre  of  mass  and  the  angle  through  which  the  rod  is  turned.) 

13.  *Each  of  the  pendulums  in  Fig  11.2  is  subjected  to  a  damping  force,  of 
magnitude  ax  and  ay  respectively,  while  there  is  a  damping  force  f3(x  — 
y)  in  the  spring.  Show  that  the  equations  for  the  normal  co-ordinates  q\ 
and  t/2  are  still  uncoupled.  Find  the  amplitudes  of  the  forced  oscillations 
obtained  by  applying  a  periodic  force  to  one  pendulum.  Given  that  the 
forcing  frequency  is  that  of  the  uncoupled  pendulums,  and  that  (3  is 
negligible,  find  the  range  of  values  of  a  for  which  the  amplitude  of  the 
second  pendulum  is  less  than  half  that  of  the  first. 

14.  *Show  that  a  stretched  string  is  equivalent  mathematically  to  an  infinite 
number  of  uncoupled  oscillators,  described  by  the  co-ordinates 


Determine  the  amplitudes  of  the  various  normal  modes  in  the  motion 
described  in  Chapter  10,  Problem  14.  Why,  physically,  are  the  modes 
for  even  values  of  n  not  excited? 
15.  Show  that  a  typical  equation  of  the  set  (11.33)  may  be  satisfied  by 
setting  Aa  =  sinafc  (a  =  1,2, ...  ,n),  provided  that  w  =  2wosin|fc. 
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Hence  show  by  considering  the  required  condition  when  a  =  n  +  1 
that  the  frequencies  of  the  normal  modes  are  uir  =  2ojq  sin[r7r/2(n+  1)], 
with  r  =  1,2, . . .  ,n.  Why  may  we  ignore  values  of  r  greater  than  n  +  1? 
Show  that,  in  the  limit  of  large  n,  the  frequency  of  the  rth  normal  mode 
tends  to  the  corresponding  frequency  of  the  continuous  string  with  the 
same  total  length  and  mass. 
16.  A  particle  moves  under  a  conservative  force  with  potential  energy  V(r). 
The  point  r  =  0  is  a  position  of  equilibrium,  and  the  axes  are  so  chosen 
that  x,  y,  z  are  normal  co-ordinates.  Show  that,  if  V  satisfies  Laplace's 
equation,  W2V  =  0  (see  §6.7),  then  the  equilibrium  is  necessarily  unsta- 
ble, and  hence  that  stable  equilibrium  under  purely  gravitational  and 
electrostatic  forces  is  impossible.  (Of  course,  dynamic  equilibrium  — 
stable  periodic  motion  —  can  occur.  Note  also  that  the  two-dimensional 
stable  equilibrium  of  Problem  11  does  not  contradict  this  result  because 
there  is  another  force  imposed,  confining  the  charge  to  the  horizontal 
plane.) 
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Chapter  12 

Hamiltonian  Mechanics 


We  have  already  seen,  in  several  examples,  the  value  of  the  Lagrangian 
method,  which  allows  us  to  find  equations  of  motion  for  any  system  in 
terms  of  an  arbitrary  set  of  generalized  co-ordinates.  In  this  chapter,  we 
shall  discuss  an  extension  of  the  method,  due  to  Hamilton.  Its  principal 
feature  is  the  use  of  the  generalized  momenta  pi,p2,  ■  ■  ■  ,Pn  in  place  of 
the  generalized  velocities  q±,  .  ■  ■ ,  qn-  It  is  particularly  valuable  when, 
as  often  happens,  some  of  the  generalized  momenta  are  constants  of  the 
motion.  More  generally,  it  is  well  suited  to  finding  conserved  quantities, 
and  making  use  of  them. 


12.1    Hamilton's  Equations 

The  Lagrangian  function  L  for  a  natural  system  is  a  function  of  qi,  qn,  •  •  ■ ,  <Zn 
and  qi,q2, . . .  ,q„.  For  brevity,  we  shall  indicate  this  dependence  by  writing 
L(q,  q),  where  q  stands  for  all  the  generalized  coordinates,  and  q  for  all  their 
time  derivatives. 

Lagrange's  equations  may  be  written  in  the  form 

Pa  =  (12.1) 

dqa 

where  the  generalized  momenta  are  defined  by 

dL 

Pa  =  12.2 

dqa 

Here  and  in  the  following  equations,  a  runs  over  1,2, ...  ,n. 

The  instantaneous  position  and  velocity  of  every  part  of  our  system 
may  be  specified  by  the  values  of  the  2n  variables  q  and  q.  However,  we 


277 


278 


Classical  Mechanics 


can  alternatively  solve  the  equations  (12.2)  for  the  q  in  terms  of  q  and  p, 
obtaining,  say, 

qa  =  qa(q,p),  (12.3) 

so  the  2n  variables  q  and  p  also  serve  equally  well  to  specify  the  instanta- 
neous position  and  velocity  of  every  particle. 

For  example,  for  a  particle  moving  in  a  plane,  and  described  by  polar  co- 
ordinates, the  generalized  momenta  are  given  by  pr  =  mr  and  pg  =  mr20. 
In  this  case,  Eqs.  (12.3)  read 

.  =  £r  §=PS_  (124) 

m  mrz 

The  instantaneous  position  and  velocity  of  the  particle  may  be  fixed  by  the 
values  of  r,  6,pr  and  pg. 

We  now  define  a  function  of  q  and  p,  the  Hamiltonian  function,  by 

n 

H(q,p)  =  ^2pf3qp{q,p)  ~  L(q,q(q,p)).  (12.5) 
p=i 

Next,  we  compute  the  derivatives  of  H  with  respect  to  its  2n  independent 
variables.  We  differentiate  first  with  respect  to  pa.  One  term  in  this  deriva- 
tive is  the  coefficient  of  pa  in  the  sum  ^pq,  namely  qa.  Other  terms  arise 
from  the  dependence  of  each  qp,  either  in  the  sum  or  as  an  argument  of  L, 
on  pa.  Altogether,  we  obtain 

dH_  _  .  ^  dqp  ^  dL  dqp 
dpa-qa  +  ^P0dpa  fedqpdp* 

Now,  by  (12.2),  the  second  and  third  terms  cancel.  Hence  we  are  left  with 

r)  H 

jr~  =  ?«•  (12.6) 
dpa 

We  now  examine  the  derivative  with  respect  to  qa.  Again,  there  are  two 
kinds  of  terms,  the  term  coming  from  the  explicit  dependence  of  L  on  qai 
and  those  from  the  dependence  of  each  qp  on  qa.  We  find 
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As  before,  the  second  and  third  terms  cancel.  Thus,  using  Lagrange's 
equations  (12.1),  we  obtain 

flH 

Wa  =  (12'7) 

The  equations  (12.6)  and  (12.7)  together  constitute  Hamilton's  equa- 
tions. Note  that  whereas  Lagrange's  equations  are  a  set  of  n  second-order 
differential  equations,  Hamilton's  constitute  a  set  of  2n  first-order  equa- 
tions. 

Let  us  consider,  for  example,  a  particle  moving  in  a  plane  under  a  cen- 
tral, conservative  force. 


Example:  Central  conservative  force 

Obtain  Hamilton's  equations  for  a  particle  moving  in  a  plane 
with  potential  energy  function  V(r). 

Here  the  Lagrangian  is 

L  =  \mr2  +  \mr2d2  -  V(r). 

Thus  the  Hamiltonian  function  is 

H  =  {prr+peO)  -  \\mr2  +  \mr262  -  V(r)], 

or,  using  (12.4)  to  eliminate  the  velocities, 

Note  that  this  is  the  expression  for  the  total  energy,  T  +  V .  This  is 
no  accident,  but  a  general  property  of  natural  systems,  as  we  shall  see 
below. 

The  first  pair  of  Hamilton's  equation,  (12.6),  are 

r=^  =  P~,       0=^=*L.  (12.9) 
opr      m  opg  mrz 

They  simply  reproduce  the  relations  (12.4)  between  velocities  and  mo- 
menta. The  second  pair,  (12.7),  are 

dH         pi      dV  dH     n  ,  n  r, 

-Pr  =  IT  =  -3  +  -7-'  -P0  =  -^  =  °-  12'10 

Or        mr6      dr  aO 
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The  second  of  these  two  equations  yields  the  law  of  conservation  of  an- 
gular momentum, 

pe  =  J  =  constant.  (12.11) 

The  first  gives  the  radial  equation  of  motion, 

..      J2  dV 
pr  =  mr=   ;r  -  —. 

It  may  be  integrated  to  give  what  we  termed  the  'radial  energy  equation' 
(4.12)  in  Chapter  4. 

12.2     Conservation  of  Energy 

We  saw  in  §10.1  that  a  natural  system  is  characterized  by  the  fact  that 
the  kinetic  energy  contains  no  explicit  dependence  on  the  time,  and  is 
a  homogeneous  quadratic  function  of  the  time  derivatives  q.  This  latter 
condition  may  be  expressed  algebraically  by  the  equation 

For  example,  for  n  =  2,  T  has  the  form  (11.1).  Thus 
dT  dT 

TT^qi  +  777-92  =  (fflll^l  +  ai2<?2)<jl  +  ((12191  +  a22<?2)<22  =  IT. 

oqi  dq2 
Since  pa  =  dT/dqa,  we  therefore  have 

(3=1  0=1  q/3 

=  2T  -  (T  -  V)  =  T  +  V. 

Thus,  for  a  natural  system,  the  value  of  the  Hamiltonian  function  is  equal 
to  the  total  energy  of  the  system. 

For  a  forced  system,  the  Lagrangian  can  sometimes  be  written,  as  we  saw 
in  §10.4,  in  the  form  L  =  T'  —  V' ,  where  T'  is  a  homogeneous  quadratic 
in  the  variables  q,  and  V  is  independent  of  them.  In  such  a  case,  the 
Hamiltonian  function  is  equal  to  T'  +  V,  which  is  in  general  not  the  total 
energy. 

Now  let  us  examine  the  time  derivative  of  H .  We  shall  now  allow  for  the 
possibility  that  H  may  contain  an  explicit  time  dependence  (as  it  does  for 
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some  forced  systems),  and  write  H  =  H(q,  p,  t).  Then  the  value  of  H  varies 
with  time  for  two  reasons:  firstly,  because  of  its  explicit  time  dependence, 
and,  secondly,  because  the  variables  q  and  p  are  themselves  functions  of 
time.  Thus  the  total  time  derivative  is 

dH  _  dH     ^  dH  .  Aaff, 

Now,  if  we  express  q  and  p  in  terms  of  derivatives  of  H,  using  Hamilton's 
equations  (12.6)  and  (12.7),  we  obtain 

dH  _  dH     ^  /  dH  dH     dH  dH  \ 
dt       dt      ~[  V  dQa  dpa      dpa  dqa  J  ' 

Obviously,  the  terms  in  parentheses  cancel,  whence 


dH  _  dH 

~dT  ~  ~dt' 


(12.12) 


This  equation  asserts  that  the  value  of  H  changes  in  time  only  because  of 
its  explicit  time  dependence.  The  net  change  induced  by  the  fact  that  q 
and  p  vary  with  time  is  zero. 

In  particular,  for  a  natural,  conservative  system,  neither  T  nor  V  con- 
tains any  explicit  dependence  on  the  time.  Thus  dH/dt  =  0,  and  it  follows 
that 

f=0.  (1,13) 
Thus  there  is  a  law  of  conservation  of  energy, 

H  =  T  +  V  =  E  =  constant.  (12.14) 

In  a  forced  system,  if  H  =  T'  +  V' ,  and  is  time-independent,  we  again 
have  a  conservation  law  —  like  (10.20)  —  though  not  for  T  +  V. 

Even  when  H  is  not  of  the  form  T'  +  V ,  an  energy  conservation  law 
may  exist.  The  prime  example  of  this  is  that  of  a  charged  particle  moving 
in  static  electric  and  magnetic  fields  E  and  B.  Since  the  magnetic  force 
is  perpendicular  to  r,  it  does  no  work,  and  the  sum  of  the  kinetic  and 
electrostatic  potential  energies  is  a  constant.  It  is  interesting  to  see  how 
this  emerges  from  the  Hamiltonian  formalism. 
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Example:  Particle  in  electric  and  magnetic  fields 

Find  the  Hamiltonian  for  a  charged  particle  in  electric  and  mag- 
netic fields.  Show  that,  if  the  fields  are  time-independent,  there 
is  an  energy  conservation  law. 

We  begin  with  the  Lagrangian  (10.27): 

L  =  ^mr2  +  qr  ■  A(r,  t)  -  q<f>(r,  t), 

from  which,  as  in  (10.28),  it  follows  that  the  generalized  momentum  is 
p  =  mr  +  qA.  Thus  we  find 


Now,  if  E  and  B  are  time-independent,  it  is  possible  to  choose  the 
scalar  and  vector  potentials  4>  and  A  also  to  be  so  (see  §A.7).  Thus  H 
has  no  explicit  time-dependence,  and  therefore  is  conserved:  H  =  E  = 
constant. 

Note  that  in  this  case,  L  does  have  a  term  linear  in  f:  it  has  the 
form  L  =  L2  +  Li  +  Lq,  where  is  of  degree  k  in  f.  In  such  a  case,  it 
turns  out  that  H  =  L2  —  Lq.  The  'magnetic'  term  L\  apparently  does 
not  appear.  In  fact,  B  appears  in  H  only  via  the  relation  between  r  and 
P- 

The  Hamiltonian  formalism  is  particularly  well  suited  to  finding  conser- 
vation laws,  or  constants  of  the  motion.  The  conservation  law  for  energy  is 
the  first  of  a  large  class  of  conservation  laws  which  we  shall  discuss  in  the 
following  sections. 

12.3    Ignorable  Co-ordinates 

It  sometimes  happens  that  one  of  the  generalized  co-ordinates,  say  qa,  does 
not  appear  in  the  Hamiltonian  function  (though  the  corresponding  momen- 
tum, pa,  does).  In  that  case,  the  co-ordinate  qa  is  said  to  be  ignorable  — 
for  a  reason  we  shall  explain  in  a  moment. 

For  an  ignorable  co-ordinate,  Hamilton's  equation  (12.7)  yields 


H  =  p  ■  r  —  L 


{p-qAf 


+  q<j>. 


(12.15) 


2m 


-Pa 


(12.16) 
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It  leads  immediately  to  a  conservation  law  for  the  corresponding  generalized 
momentum, 

pa  =  constant.  (12.17) 

For  example,  for  a  particle  moving  in  a  plane  under  a  central,  conservative 
force,  H  is  independent  of  the  angular  co-ordinate  9  and  we  therefore  have 
the  law  of  conservation  of  angular  momentum,  (12.11). 

The  term  'ignorable  co-ordinate'  means  just  what  it  says:  that  for  many 
purposes  we  can  ignore  the  co-ordinate  qa ,  and  treat  the  corresponding  pa 
simply  as  a  constant  appearing  in  the  Hamiltonian  function.  This  is,  in 
effect,  what  we  did  for  the  central  force  problem  in  Chapter  4.  Because  of 
the  conservation  law  for  angular  momentum,  we  were  able  to  deal  with  an 
effectively  one- dimensional  problem  involving  only  the  radial  co-ordinate  r. 
The  generalized  momentum  pe  =  J  was  simply  a  constant  appearing  in  the 
equation  of  motion  or  the  energy  conservation  equation. 

Let  us  re-examine  this  problem  from  the  Hamiltonian  point  of  view. 
Since  the  Hamiltonian  (12.8)  is  independent  of  6,  so  9  is  ignorable.  Thus, 
we  may  regard  (12.8)  as  the  Hamiltonian  for  a  system  with  one  degree  of 
freedom,  described  by  the  co-ordinate  r  and  its  corresponding  momentum 
pr,  in  which  a  constant  pg  appears.  It  is  identical  with  the  Hamiltonian  for  a 
particle  moving  in  one  dimension  under  a  conservative  force  with  potential 
energy  function 

f7(r)  =  2^  +  T/(r)-  (12A8) 

This  is  precisely  the  'effective  potential  energy  function'  of  (4.13). 
Hamilton's  equations  for  r  and  pr  are 

.  _dB_  _pr_  .   _dH  _dU 

dpr      m'  ^r      dr  dr 

To  solve  the  central  force  problem,  we  solve  first  this  one-dimensional  prob- 
lem (for  example,  using  its  energy  conservation  equation,  the  'radial  energy 
equation').  Our  solution  gives  us  complete  information  about  the  radial 
motion  —  it  gives  r  as  a  function  of  r,  and  therefore  r  as  a  function  of  t, 
by  integrating. 

Any  required  information  about  the  angular  part  of  the  motion  can  then 
be  found  from  the  remaining  pair  of  Hamilton's  equations,  one  of  which  is 
the  angular  momentum  conservation  equation,  pg  =  0,  while  the  other  gives 
9  in  terms  of  pg ,  in  the  form  9  =  pg/ mr2 .  Clearly,  though  we  did  not  then 
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introduce  the  Hamiltonian,  this  is  essentially  just  the  method  we  used  in 
Chapter  4. 

We  shall  use  the  same  method  in  the  following  section  to  discuss  the 
general  motion  of  a  symmetric  top. 

Example:  The  symmetric  top 

Show  that  for  the  symmetric  top  two  of  the  three  Euler  angles  are 
ignorable  co-ordinates,  and  find  the  'effective  potential  energy 
function'  for  the  remaining  co-ordinate. 

We  start  from  the  Lagrangian  function  (10.11): 

L  =  \h^2  sin2  9  +  \h62  +  \lz{i>  +  (pcose)2  -  MgRcosO. 
The  corresponding  generalized  momenta  are 

Pip  =  hp  sin2  9  +  Iz{4>  +  ipcos9)  cos 9, 
Pe  =  h0, 

Pi,  =  h(ip  +  ftcos9). 
Solving  these  equations  for  tp,  9,  ip,  we  obtain 

•  _  Pf  -  Pi>  cos  6 

*  ~      h  sin2  9  ' 

a-  Pe 

h         h  sin2  9 


(12.19) 


The  simplest  way  to  construct  the  Hamiltonian  function  is  to  use  the 
fact  that  H  =  T+V,  and  express  T  in  terms  of  the  generalized  momenta 
using  (12.19).  In  this  way,  we  find 

=  (Pp-p^os9f  +  ^  +  MgRcoS9.  (12.20) 

2/isin26»         2h     2/3        y  K  ' 

It  is  easy  to  verify  that  the  first  set  of  Hamilton's  equations,  (12.6), 
correctly  reproduce  (12.19). 

It  is  clear  that  the  two  co-ordinates  ip  and  ip  here  are  ignorable,  and 
there  are  two  corresponding  conservation  laws,  pv  =  constant  and  p^  = 
constant.  Thus  the  problem  can  be  reduced  to  that  of  a  system  with  one 
degree  of  freedom  only,  described  by  the  co-ordinate  9.  The  Hamiltonian 
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function  (12.20)  may  be  written 


H=^-  +  U(9), 


where  the  effective  potential  energy  function  U(9)  is 

(pv  ~Pj>  cos6>)2 

2h  sin2  9  2/3 


U(9)  =  ^  ~  +^  +  MgR  cos  9.  (12.21) 


12.4    General  Motion  of  the  Symmetric  Top 

Hamilton's  equations  for  9  and  pg  give 

This  is  obviously  a  rather  complicated  equation  to  solve.  However,  the 
qualitative  features  of  the  motion  can  be  found  from  the  energy  conservation 
equation, 

£i-  +  U(9)  =E  =  constant.  (12.23) 
2ii 

In  particular,  the  angles  9  at  which  9  =  0  are  given  by  the  equation  U(9)  = 
E,  and  the  motion  is  confined  to  the  region  where  U(9)  <  E. 

Now  let  us  examine  the  function  U{9).  We  exclude  for  the  moment  the 
special  case  where  pv  =  ±p^.  (We  return  to  this  important  special  case  in 
the  next  section.)  Then  it  is  clear  from  (12.21)  that  as  9  approaches  either 
0  or  7T,  U{9)  — >  +00.  Hence  it  has  roughly  the  form  shown  in  Fig.  12.1, 
with  a  minimum  at  some  value  of  9,  say  9q,  between  0  and  tv.  It  can  be 
shown  that  there  is  only  one  minimum  (see  Problem  7).  When  E  is  equal 
to  this  minimum  value,  we  have  an  'equilibrium'  situation,  and  9  remains 
fixed  at  6*o-  This  corresponds  to  steady  precession.  For  any  larger  value  of 
E,  the  angle  9  oscillates  between  a  minimum  9\  and  a  maximum  #2- 

It  is  not  hard  to  describe  the  motion  of  the  top.  We  note  that,  according 
to  (12.19),  the  angular  velocity  of  the  axis  about  the  vertical,  ip,  is  zero  when 
cos#  =  Pip/ Pip-  If  this  angle  lies  outside  the  range  between  9\  and  #2  (or 
if  \p<p/Pip\  >  1),  then  ip  never  vanishes  and  the  axis  precesses  round  the 
vertical  in  a  fixed  direction,  and  wobbles  up  and  down  between  9\  and  02- 
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o     9j        e0  e2         n  q 

Fig.  12.1 

This  motion  is  illustrated  in  Fig.  12.2,  which  shows  the  track  of  the  end  of 
the  axis  on  a  sphere. 


Fig.  12.2 

On  the  other  hand,  if  9\  <  arccos(pv/p^)  <  62,  the  axis  moves  in  loops, 
as  shown  in  Fig.  12.3.  The  angular  velocity  has  one  sign  near  the  top  of 
the  loop,  and  the  opposite  near  the  bottom. 

The  limiting  case  between  the  two  kinds  of  motion  occurs  when 
arccos(pv/p^)  =  6±.  Then  the  loops  shrink  to  cusps,  as  shown  in  Fig.  12.4. 
The  axis  of  the  top  comes  instantaneously  to  rest  at  the  top  of  each  loop. 
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Fig.  12.4 


This  kind  of  motion  will  occur  if  the  top  is  set  spinning  with  its  axis  ini- 
tially at  rest.  (It  is  impossible  to  have  cusped  motion  with  the  cusps  at  the 
bottom,  for  they  correspond  to  points  of  minimum  kinetic  energy,  and  the 
motion  must  always  be  below  such  points.  A  top  set  spinning  with  its  axis 
stationary  cannot  rise  without  increasing  its  energy.) 
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It  is  easy  to  observe  these  kinds  of  motion  using  a  small  gyroscope.  In 
practice,  because  of  frictional  effects,  the  type  of  motion  will  change  slowly 
with  time. 

Example:  Stability  of  a  vertical  top 

A  top  is  set  spinning  with  angular  velocity  103  with  its  axis  ver- 
tical. How  will  it  move? 

If  the  axis  of  the  top  passes  through  the  vertical,  9  =  0,  then  it  is 
clear  that  U (0)  must  be  finite.  This  is  possible  only  if  pv  =  p^,  and  both 
generalized  momenta  must  be  equal  to  I3W3.  (We  could  also  consider 
types  of  motion  for  which  the  axis  passes  through  the  downward  vertical, 
9  =  7r.  In  that  case,  we  require  pv  =  —p^,  =  —  I3UJ3.  The  treatment  is 
entirely  similar  —  but  less  interesting  because  a  downward-pointing  top 
is  always  stable.) 

If  we  set  Pip  =  p^  =  I3W3,  the  effective  potential  energy  function 
(12.21)  becomes 

1-2  2 

U{0)  =  -^-^t&n2  \6  +  \IzujI  +  MgRcosO,  (12.24) 

where  we  have  used  the  identity  (1  —  cos  9)/ sin  9  =  tan  \9. 

For  small  values  of  9,  we  may  expand  U(9),  and  retain  only  the  terms 
up  to  order  92 ,  obtaining 


U(9) 


\\hul  +  MgR)  +  \  (M-  -  MgR^j  9\  (12.25) 

Since  there  is  no  linear  term,  9  =  0  is  always  a  position  of  equilibrium. 
It  is  a  position  of  stable  equilibrium  if  U{9)  has  a  minimum  at  9  =  0, 
that  is,  if  the  coefficient  of  92  is  positive.  Thus  there  is  a  minimum  value 
of  0J3  for  which  the  vertical  top  is  stable,  given  by 

MyMgR 


u)z  =   -2  =  u)0,     say.  (12.26) 

i3 


If  the  top  is  set  spinning  with  angular  velocity  greater  than  this 
critical  value,  it  will  remain  vertical.  When  the  angular  velocity  falls 
below  the  critical  value  (as  it  eventually  will,  because  of  friction),  the 
top  will  begin  to  wobble.  The  energy  of  the  vertical  top  is 

E  =  \l3ul  +  MgR. 
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Thus  the  angles  at  which  6  =  0  are  given  by  U(9)  =  E  (see  Fig.  12.5), 
or,  from  (12.24),  by 


0  02         it  fl 


Fig.  12.5 

They  are  easily  seen  to  be  8\  =  0  and  82  =  2  arccos(a>3/w0).  Thus  if 
the  top  is  set  spinning  with  its  axis  vertical  and  almost  stationary,  with 
angular  velocity  less  than  the  critical  value  ujq,  it  will  oscillate  in  the 
subsequent  motion  between  the  vertical  and  the  angle  62-  Note  that 
62  increases  as  is  decreased,  and  tends  to  tt  as  ^3  approaches  zero. 
When  ui-j  =  0,  the  top  behaves  like  a  compound  pendulum,  and  swings 
in  a  circle  through  both  the  upward  and  downward  verticals. 

12.5    Liouville's  Theorem 

This  theorem  really  belongs  to  statistical  mechanics,  but  it  is  interesting 
to  consider  it  here  because  it  is  a  very  direct  consequence  of  Hamilton's 
equations. 

The  instantaneous  position  and  velocity  of  every  particle  in  our  system 
is  specified  by  the  2n  variables  (qi, . . . ,  qn,Pi,  ■  ■  ■  ,Pn)-  It  is  convenient  to 
think  of  these  as  co-ordinates  in  a  2n-dimensional  space,  called  the  phase 
space  of  the  system.  Symbolically,  we  may  write 


r  =  (q1,...,qn,p1,...,pn). 
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As  time  progresses,  the  changing  state  of  the  system  can  be  described  by 
a  curve  r(t)  in  the  phase  space.  Hamilton's  equations,  (12.6)  and  (12.7), 
prescribe  the  rates  of  change  (qi, . . . ,  qn,pi,  ■  ■  ■  ,Pn)-  These  may  be  regarded 
as  the  components  of  a  2n-dimensional  velocity  vector, 

v  =  r  =  (gi,...,«n,pi,...  ,pn). 

Now  suppose  that  we  have  a  large  number  of  copies  of  our  system, 
starting  out  with  slightly  different  initial  values  of  the  co-ordinates  and 
momenta.  For  example,  we  may  repeat  many  times  an  experiment  on  the 
same  system,  but  with  small  random  variations  in  the  initial  conditions. 
Each  copy  of  the  system  is  represented  by  a  point  in  the  phase  space, 
moving  according  to  Hamilton's  equations.  We  thus  have  a  swarm  of  points, 
occupying  some  volume  in  phase  space,  rather  like  the  particles  in  a  fluid. 

Liouville's  theorem  concerns  how  this  swarm  moves.  What  it  says  is 
a  very  simple,  but  very  remarkable,  result,  namely  that  the  representative 
points  in  phase  space  move  as  though  they  formed  an  incompressible  fluid. 
The  2n-dimensional  volume  occupied  by  the  swarm  does  not  change  with 
time,  though  of  course  its  shape  may,  and  usually  does,  change  in  very 
complicated  ways. 

To  prove  this,  we  need  to  apply  a  generalization  of  the  divergence  oper- 
ation. In  three  dimensions,  it  is  shown  in  Appendix  A  (see  (A. 24))  that  the 
fluid  velocity  in  an  incompressible  fluid  satisfies  the  condition  V  •  v  =  0.  It 
is  easy  to  see  that  the  argument  generalizes  to  any  number  of  dimensions. 
The  condition  that  the  phase-space  volume  does  not  change  with  time  in 
the  flow  described  by  the  velocity  field  v  is  simply  that  the  2n-dimensional 
divergence  of  v  is  zero,  i.e., 

V.*  =  ^  +  ...  +  ^  +  ^...  +  ^=0.  (12.27) 
c%  dqn      dpi  dpn 

But,  by  (12.6)  and  (12.7),  the  1st  and  (n+l)st  terms  are 

dqi     dpi  _  _d_  fdH_\  +  _d_  (  dH\ 

dqi      dpi      dqi  \  dpi  )      dpi  \    dqi  )  ' 

which  is  indeed  zero.  Similarly,  all  the  other  terms  in  (12.27)  cancel  in 
pairs.  Thus  the  theorem  is  proved. 

In  general,  except  for  special  cases,  the  motion  in  phase  space  is  com- 
plicated. The  phase-space  volume  containing  the  swarm  of  representative 
points  maintains  its  volume,  but  becomes  extremely  distorted,  rather  like  a 
drop  of  immiscible  coloured  liquid  in  a  glass  of  water  which  is  then  stirred. 
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The  points  cannot  just  go  anywhere  in  phase  space,  because  of  the  en- 
ergy conservation  equation.  They  must  remain  on  the  same  constant-energy 
surface,  H(q,p)  =  E,  and  of  course  there  might  be  other  conservation  laws 
that  also  restrict  the  accessible  region  of  phase  space.  But  one  might  ex- 
pect that  in  time  the  phase-space  volume  would  become  thinly  distributed 
throughout  almost  all  the  accessible  parts  of  phase  space.  Such  behaviour 
is  called  ergodic,  and  is  commonly  assumed  in  statistical  mechanics.  Aver- 
aged properties  of  the  system  over  a  long  time  can  then  be  estimated  by 
averaging  over  the  accessible  phase  space. 

Remarkably,  however,  many  quite  complicated  systems  do  not  behave 
in  this  way,  but  show  surprising  almost-periodic  behaviour,  to  be  discussed 
in  Chapter  14.  The  study  of  which  systems  do  and  do  not  behave  ergodi- 
cally,  and  particularly  the  transition  between  one  type  and  another  as  the 
parameters  of  the  system  are  varied,  is  now  one  of  the  most  active  fields  of 
mathematical  physics.  It  has  revealed  an  astonishing  range  of  possibilities. 

12.6    Symmetries  and  Conservation  Laws 

In  §§12.2  and  12.3,  we  found  some  examples  of  conserved  quantities,  but  so 
far  we  have  not  discussed  the  physical  reasons  for  their  existence.  In  fact, 
they  are  expressions  of  symmetry  properties  possessed  by  the  system. 

For  example,  the  conservation  law  of  angular  momentum  for  the  cen- 
tral force  problem,  (12.11),  arises  from  the  fact  that  the  Hamiltonian  is 
independent  of  9.  This  is  an  expression  of  the  rotational  symmetry  of  the 
system  —  in  other  words,  of  the  fact  that  there  is  no  preferred  orientation 
in  the  plane.  Explicitly,  the  equation  dH/d8  =  0  means  that  the  energy  of 
the  system  is  unchanged  if  we  rotate  it  to  a  new  position,  replacing  9  by 
9  +  56,  without  changing  r,pr  or  pg.  Tims  angular  momentum  is  conserved 
(pg  =  constant)  for  systems  possessing  this  rotational  symmetry.  Of  course, 
if  the  force  is  non-central,  it  does  determine  a  preferred  orientation  is  space, 
and  angular  momentum  is  not  conserved. 

In  a  three-dimensional  problem,  every  component  of  the  angular  mo- 
mentum J  is  conserved  if  the  force  is  purely  central.  If  the  force  is  non- 
central,  but  still  possesses  axial  symmetry  —  so  that  H  depends  on  9  but 
not  on  tp  —  then  only  the  component  of  J  along  the  axis  of  symmetry, 
namely  pv,  is  conserved. 

Similarly,  for  the  symmetric  top,  the  equation  dH/dtp  =  0  is  an  ex- 
pression of  the  rotational  symmetry  of  the  system  about  the  vertical.  The 
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corresponding  conserved  quantity  pv  is  the  vertical  component  of  J;  for, 
by  (9.43)  and  (9.45), 

Jz  =  k  ■  J  =  Ticpsin2  6  +  Iz(i{>  +  <pcos#)  cos#  =  pv. 

The  equation  dH/dtp  =  0  expresses  the  rotational  symmetry  of  the  top  itself 
about  its  own  axis.  The  energy  is  clearly  unchanged  by  rotating  the  top 
about  its  axis.  In  this  case,  we  see  from  (9.45)  that  the  conserved  quantity 
p^,  is  the  component  of  J  along  the  axis  of  the  top,  p^  =  Jz  =  ez  ■  J. 

Now  of  course  not  all  symmetries  are  expressible  simply  by  saying  that 
H  is  independent  of  some  particular  co-ordinate.  For  example,  we  might 
consider  the  central  force  problem  in  terms  of  the  Cartesian  co-ordinates  x 
and  y.  Then  the  Hamiltonian  is 

Pi  +P2y 


Since  it  depends  on  both  x  and  y,  neither  co-ordinate  is  ignorable.  It  does, 
however,  possess  a  symmetry  under  rotations.  If  we  make  a  small  rotation 
through  an  angle  59,  the  changes  in  the  co-ordinates  and  momenta  are  (see 
Fig.  A.4) 


Sx  =  —y59,  Sy  =  x50, 

5px  =  -py  59,        5pv  =  px  59. 


(12.28) 


Under  this  transformation,  5(x2  +  y2)  =  0  and  5(p2x  +  p2)  =  0,  so  clearly 
5H  =  0. 

Now  we  know  from  our  earlier  discussion  in  terms  of  polar  co-ordinates 
that  this  symmetry  is  related  to  the  conservation  of  angular  momentum, 

J  =  xpy  —  ypx  =  constant. 

The  problem  is  to  understand  the  relationship  between  the  transformation 
(12.28)  and  the  conserved  quantity  J. 

Let  us  consider  a  general  function  of  the  co-ordinates,  momenta,  and 
time,  G(q,p,t).  We  define  the  transformation  generated  by  G  to  be 

5qa  =  —  5X,       5Pa  =  -—  5X,  (12.29) 
dpa  dqa 

where  5X  is  an  infinitesimal  parameter.  For  example,  the  function  G  =  p\ 
generates  the  transformation  in  which  5q\  =  5A,  while  all  the  remaining  co- 
ordinates and  momenta  are  unchanged.  Using  Hamilton's  equations  (12.6) 
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and  (12.7),  we  see  that  the  transformation  generated  by  the  Hamiltonian  is 

3<la  =  Qa  SX,  Spa  =  pa  6\.  (12.30) 

If  SX  is  interpreted  as  a  small  time  interval,  this  represents  the  time  devel- 
opment of  the  system. 

We  can  now  return  to  the  function  J.  The  transformation  it  generates 
is  given  by 

Sx  =  — — SX  =  —ySX,  Sy  =  — — SX  =xSX, 

dpx  dpy 

SPX    =    -  TT-  SX    =    -Py  SX,  Spy    =    -  ^  SX    =  PX  SX. 

ax  ay 

This  is  clearly  identical  to  the  infinitesimal  rotation  (12.28).  Thus  we  have 
established  a  connection  between  J  and  this  transformation  (12.28). 

The  next  problem  is  to  understand  why  the  fact  that  this  transformation 
represents  a  symmetry  property  of  the  system  should  lead  to  a  conservation 
law.  To  this  end,  we  return  to  a  general  function  G,  and  consider  the  effect 
of  the  transformation  (12.29)  on  some  other  function  F{q,p,  t).  The  change 
in  F  is 

This  kind  of  sum,  involving  the  derivatives  of  two  functions,  appears  quite 
frequently,  and  it  is  convenient  to  introduce  an  abbreviated  notation.  We 
define  the  Poisson  bracket  of  F  and  G  to  be 

][F,G^     ^[(dqadpa     dpadqa)'  (12.31) 

Then  we  can  write  the  change  in  F  under  the  transformation  generated  by 
G  in  the  form 

SF  =  [F,  G]  SX.  (12.32) 

A  particular  example  is  provided  by  the  transformation  (12.30)  gener- 
ated by  H.  The  rate  of  change  of  F  is 

dF     8F     ^  /  dF  .       dF  .  \      dF     r^  T„ 

dF  =  7*  +  g  \WJa  +  dp-/a)=lH+  [F' Hl  (12'33) 
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The  extra  term  here  arises  from  the  fact  that  we  have  now  allowed  F  to  have 
an  explicit  dependence  on  the  parameter  t,  in  addition  to  the  dependence 
via  q  and  p. 

Now  an  obvious  property  of  the  Poisson  bracket  is  its  antisymmetry.  If 
we  interchange  F  and  G,  we  merely  change  the  sign: 

[G,F]=-[F,G].  (12.34) 

This  has  the  important  consequence  that,  if  F  is  unchanged  by  the  trans- 
formation generated  by  G,  then  reciprocally  G  is  unchanged  by  the  trans- 
formation generated  by  F. 

We  are  now  finally  in  a  position  to  apply  this  discussion  to  the  case  of  a 
symmetry  property  of  the  system.  Let  us  suppose  that  there  exists  a  trans- 
formation of  the  co-ordinates  and  momenta  which  leaves  the  Hamiltonian 
unaffected,  and  which  is  generated  by  a  function  G.  From  (12.29)  we  see 
that  the  generator  G  is  unique,  apart  from  an  arbitrary  additive  function 
of  t,  independent  of  q  and  p.  In  particular,  if  the  transformation  does  not 
involve  the  time  explicitly,  then  G  may  be  chosen  to  contain  no  explicit  t 
dependence.  The  condition  that  H  should  be  unchanged  is 

SH  =[H,G}5\  =  0.  (12.35) 

It  then  follows  from  the  reciprocity  relation  (12.34)  that  [G,  H\  =  0  also. 
Hence  if  dG/dt  =  0,  we  find  from  (12.33)  that 

^  =  [G,H}=0.  (12.36) 

Thus  we  have  shown  that  if  H  is  unaltered  by  a  i-independent  transforma- 
tion of  this  type,  then  the  corresponding  generator  is  conserved. 

The  number  of  independent  symmetries  possessed  by  a  system,  and 
hence  the  number  of  conserved  quantities,  has  a  profound  effect  on  the 
way  that  the  system  may  behave.  By  exploiting  the  conservation  laws,  the 
complexity  of  problems  may  be  reduced  progressively.  If  the  number  of 
independent  conserved  quantities  for  a  system,  i.e.,  constants  or  integrals 
of  the  motion,  is  at  least  equal  to  the  number  of  degrees  of  freedom  (the 
number  of  independent  co-ordinates),  then  the  reduction  may  be  complete, 
and  the  system  is  termed  integrable  (in  the  sense  of  Liouville  —  see  §14.1). 
The  motion  is  then  ordered  and  'regular'.  If  there  are  insufficient  conserved 
quantities  to  bring  this  about,  then  the  motion  may  exhibit  disorder  or 
'chaos'. 
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The  central,  conservative  force  problems  considered  in  Chapter  4,  and 
the  symmetric  top  considered  in  §10.3  and  §§12.3,  12.4,  are  examples  of 
integrable  systems,  since  they  possess  respectively  two  and  three  conserved 
quantities,  equal  in  each  case  to  the  number  of  degrees  of  freedom.  The 
restricted  three-body  problem,  considered  in  Problems  15  and  16  of  Chapter 
10,  and  in  Problems  12  and  13  at  the  end  of  this  chapter,  does  not  possess 
such  a  complete  set  of  conserved  quantities,  and  so  is  not  integrable. 

The  formal  description  of  the  connection  between  symmetry  properties 
and  invariance  is  contained  in  a  famous  theorem  due  to  Emmy  Noether 
(1918). 

12.7    Galilean  Transformations 

To  illustrate  the  ideas  of  the  preceding  section,  we  shall  consider  a  general, 
isolated  system  of  N  particles,  and  investigate  the  symmetry  properties 
implied  by  the  relativity  principle  of  §1.1. 

The  system  has  3N  degrees  of  freedom,  and  may  be  described  by  the  par- 
ticle positions  r»  and  momenta  pi  (i  =  1,2,...,  N).  We  shall  consider  four 
distinct  symmetry  properties,  associated  with  the  requirements  that  there 
should  be  no  preferred  zero  of  the  time  scale,  origin  in  space,  orientation 
of  axes,  or  standard  of  rest.  The  corresponding  symmetry  transformations 
are  translations  in  time,  spatial  translations,  rotations,  and  transformations 
between  frames  moving  with  uniform  relative  velocity  (sometimes  called 
boosts).  A  combination  of  these  four  types  of  transformation  is  the  most 
general  transformation  which  takes  one  inertial  frame  into  another.  They 
are  known  collectively  as  Galilean  transformations.  We  consider  them  in 
turn. 

Time  translations 

The  changes  in  r  and  p  in  an  infinitesimal  time  St  are  generated  by  the 
Hamiltonian  function  H.  The  condition  for  invariance  of  H  under  this 
transformation  is  [H,H]  =  0,  which  is  certainly  true  because  of  (12.34). 
Thus,  as  we  showed  in  §12.2,  if  H  contains  no  explicit  time  dependence, 
then  it  is  in  fact  conserved, 


(12.37) 
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Spatial  translations 

An  infinitesimal  translation  of  the  system  through  a  distance  Sx  in  the  x 
direction  is  represented  by  the  transformation 

Sx%  =  Sx,        Syi  =  0,  Sz,  =  0,  ^ 

Spxl  =  0,        Spyl  =  0,        Spzl  =  0. 

The  corresponding  generator  is  easily  seen  to  be  the  total  ^-component  of 
momentum. 


Px  = 


N 

try  . 
XI- 


»  =  1 

The  condition  for  if  to  be  invariant  under  this  transformation  is 

N 

O.v, 


0=[H,Px]5x  =  ^2  —  5x. 

i=l 


It  is  satisfied  if  H  depends  only  on  the  co-ordinate  differences  Xi  —  xy, 
for  then,  changing  each  x{  by  the  same  amount  cannot  affect  H.  When 
this  condition  holds,  we  obtain  a  conservation  law  for  the  ^-component  of 
momentum,  dPx/dt  =  0. 

More  generally,  a  translation  in  the  direction  of  the  unit  vector  n  is 
generated  by  the  component  of  the  total  momentum  P  in  this  direction, 
n  ■  P.  When  the  system  possesses  translational  invariance  in  all  directions, 
then  all  components  of  P  are  conserved: 

This  is  physically  very  reasonable.  We  know  from  our  earlier  work  that 
momentum  is  conserved  for  an  isolated  system,  but  not  for  a  system  sub- 
jected to  external  forces  which  determine  a  preferred  origin  (for  example, 
a  centre  of  force). 

Rotations 

An  infinitesimal  rotation  through  an  angle  Sip  about  the  2-axis  yields 

Sxi  =  -yl5(fi,  5yi=XiS<f,  Szi=  0,  (12  40) 

5pxi  =  -pVi  S(fi,        8pyi  =  pxi  Sip,        Spzi  =  0. 
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The  corresponding  generator  is  the  z-component  of  the  total  angular  mo- 
mentum, 

N 

Jz  —  ^  ^(%iPyi  ~  ViPxi)- 
i=l 

The  condition  for  H  to  be  rotationally  symmetric,  [H,  Jz]  =  0,  is  satis- 
fied provided  that  H  involves  the  x  and  y  co-ordinates  and  momenta  only 
through  invariant  combinations  like  x%Xj  +  yiUj. 

In  general,  a  rotation  through  an  angle  dip  about  an  axis  in  the  direction 
of  the  unit  vector  n  may  be  written  in  the  form 

5ri  =  nAri5ip,        5pi  =  n  Apj^Sip.  (12.41) 

It  is  generated  by  the  appropriate  component  of  the  angular  momentum, 
n  ■  J. 

The  Hamiltonian  function  is  invariant  under  this  transformation  pro- 
vided that  it  is  a  scalar  function  of  r{  and  pi,  involving  only  the  squares 
and  scalar  products  of  vectors.  It  is  easy  to  verify  that  scalar  quantities  are 
indeed  unchanged  by  (12.41).  For  example, 

5(ri  ■  rj)  =  {Sr,)  ■  rj  +  r»  •  (Srj)  =  [(n  A  rz)  ■  rd  +  rt  ■  (n  A  rj)]Sip  =  0, 

by  the  symmetry  of  the  scalar  triple  product  (see  (A. 13)). 

When  the  Hamiltonian  possesses  complete  rotational  symmetry,  then 
every  component  of  J  is  conserved,  and  so 


So  far,  we  have  shown  that  the  conservations  laws  of  energy,  momentum, 
and  angular  momentum  are  expressions  of  symmetry  properties  required  by 
the  relativity  principle.  They  have,  therefore,  a  much  more  general  validity 
than  the  specific  assumptions  used  in  their  original  derivation.  For  example, 
we  have  not  assumed  that  the  forces  in  our  system  are  all  two-body  forces,  or 
that  they  are  central  or  conservative.  All  we  have  assumed  is  the  existence 
of  the  Hamiltonian  function,  and  the  relativity  principle. 

There  remains  one  type  of  Galilean  transformation,  which  is  in  some 
ways  rather  different  from  the  others. 


(12.42) 
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Transformations  to  moving  frames 

Let  us  consider  the  effect  of  giving  our  system  a  small  overall  velocity  5v  in 
the  x  direction  —  a  small  'boost'.  The  corresponding  transformations  are 

Sxj  =  tdv,  5yi  =  0,  Szi  =  0, 

y  '  (12.43) 

Spxi  =  rrii  6v,        5pyi  =  0,        5pzi  =  0. 

This  transformation  differs  from  the  others  we  have  considered  in  that  t 
appears  explicitly  in  (12.43).  The  generator  of  this  transformation  is 


Gx  =  J^iP^  ~  mixi)  =  P^  ~  MX' 


where  M  is  the  total  mass,  and  X  is  the  x  co-ordinate  of  the  centre  of 
mass.  This  generator  is  also  explicitly  time-dependent.  It  is  clearly  the  x 
component  of  the  vector 

G  =  Pt-  MR.  (12.44) 

Transformations  to  frames  moving  in  other  directions  are  generated  by  ap- 
propriate components  of  this  vector. 

We  must  now  be  careful.  For,  because  of  the  explicit  time-dependence, 
it  is  no  longer  true  that  H  must  be  invariant  under  (12.43).  Indeed,  we  know 
that  the  energy  of  a  system  does  depend  on  the  choice  of  reference  frame, 
though  not  on  the  choice  of  origin  or  axes.  What  the  relativity  principle 
actually  requires  is  that  the  equations  of  motion  should  be  unchanged  by 
the  transformation.  It  can  be  shown  (see  Problem  17)  that  the  condition 
for  this  is  still 

f  =  0.  (1,45) 

However,  this  no  longer  implies  the  invariance  of  H.  In  fact,  using  (12.33) 
and  (12.34),  we  find 

BC 

5H  =[H,G}5\=  -  [G,H}5\=  —5\.  (12.46) 

Thus  the  change  in  H  is  related  to  the  explicit  time-dependence  of  G. 
The  fourth  of  the  basic  conservation  laws,  for  the  quantity  (12.44),  is 

f  =  A(Pt-Mtf)  =  0.  (12.47) 
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Though  this  is  an  unfamiliar  form,  the  equation  is  actually  quite  familiar. 
For,  since  dP /dt  =  0,  it  may  be  written 

P-M^=0, 

dt 

which  is  simply  the  relation  (8.7)  between  the  total  momentum  and  the 
centre-of-mass  velocity.  This  relation  too  is  therefore  a  consequence  of  the 
relativity  principle. 

Let  us  now  turn  to  the  question  of  what  this  symmetry  implies  for  the 
form  of  the  Hamiltonian  function.  For  a  transformation  in  the  x  direction, 
the  change  in  H,  given  by  (12.46),  is 

8C 

SH  =        Sv  =  Px  Sv.  (12.48) 
at 

Now,  if  we  write 

N  2 

H  =  T  +  V,        where       T  =  V  p-, 

2m; 

i=i 

then  we  find  directly  from  (12.43)  that 

7V 

ST  =  ^pxi  Sv  =  Px  Sv. 
t=i 

Thus  the  change  in  kinetic  energy  is  exactly  what  is  demanded  by  (12.48), 
and  this  relation  reduces  to 


SV  =  0.  (12.49) 

It  is  interesting  to  examine  the  conditions  imposed  on  V  by  this  require- 
ment. We  have  already  seen  that  V  must  be  a  scalar  function,  and  must 
involve  the  particle  positions  only  through  the  differences  =  —  Tj. 
Since  these  are  unaffected  by  (12.43),  the  requirement  (12.49)  imposes  no 
further  restrictions  on  the  r  dependence.  Moreover,  V  must  contain  no 
explicit  dependence  on  time.  However,  none  of  the  conditions  imposed  so 
far  requires  it  to  be  independent  of  the  momenta.  To  satisfy  (12.49),  it 
must  contain  them  only  through  the  combinations 
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which  are  easily  seen  to  be  invariant  under  (12.43),  but  this  is  the  only 
new  requirement.  Thus  the  most  general  form  of  interaction  in  an  N- 
particle  system  which  is  invariant  under  all  Galilean  transformations  is 
one  described  by  a  'potential  energy  function'  which  is  an  arbitrary  scalar 
function  of  the  relative  position  vectors  and  the  relative  velocity  vectors. 

Reflections;  parity 

There  is  one  other  type  of  co-ordinate  transformation  which  might  be  men- 
tioned. All  those  we  have  considered  so  far  have  the  property  that  if  we 
start  with  a  right-handed  set  of  axes,  then  the  transformation  takes  us 
to  another  right-handed  set.  However,  we  could  also  consider  transforma- 
tions like  reflections  (say  x  — >  —x,y  — >  y,z  — >  z)  or  inversions  (r  — >  —  r) 
which  lead  from  a  right-handed  set  of  axes  to  a  left-handed  set.  These  are 
called  improper  co-ordinate  transformations.  They  differ  from  the  proper 
transformations,  such  as  rotations,  in  being  discrete  rather  than  continu- 
ous —  no  continuous  change  can  ever  take  a  right-handed  set  of  axes  into 
a  left-handed  set. 

The  condition  for  the  Hamiltonian  to  be  unchanged  also  under  improper 
co-ordinate  transformations  is  that  it  should  be  a  true  scalar  function,  like 
Vi  ■  rj,  rather  than  a  pseudoscalar,  like  (r«  A  r,-)  •  Vk,  which  changes  sign 
under  inversion.  If  this  condition  is  fulfilled,  the  equations  of  motion  will 
have  the  same  form  in  right-handed  and  left-handed  frames  of  reference. 

Because  of  the  discontinuous  nature  of  these  transformations,  this  sym- 
metry does  not  lead  to  a  conservation  law  for  some  continuous  variable.  In 
fact,  in  classical  mechanics,  it  does  not  lead  to  a  conservation  law  at  all. 
However,  in  quantum  mechanics,  it  yields  a  conservation  law  for  a  quantity 
known  as  the  parity,  which  has  only  two  possible  values  ±1.  Until  1957, 
it  was  believed  that  all  physical  laws  were  unchanged  by  reflections,  but  it 
was  then  discovered  by  Wu  et  al,  following  a  theoretical  prediction  by  Lee 
and  Yang,  that  parity  is  in  fact  not  conserved  in  the  process  of  radioactive 
decay  of  atomic  nuclei.  The  laws  describing  such  processes  do  not  have  the 
same  form  in  right-handed  and  left-handed  frames  of  reference. 

12.8  Summary 

The  Hamiltonian  method  is  an  extremely  powerful  tool  in  dealing  with  com- 
plex problems.  In  particular,  when  the  Hamiltonian  function  is  independent 


Hamiltonian  Mechanics 


301 


of  some  particular  co-ordinate  qa,  then  the  corresponding  generalized  mo- 
mentum pa  is  conserved.  In  such  a  case,  the  number  of  degrees  of  freedom 
is  effectively  reduced  by  one. 

More  generally,  we  have  seen  that  any  symmetry  property  of  the  system 
leads  to  a  corresponding  conservation  law.  This  can  be  of  great  importance 
in  practice,  since  the  amount  of  labour  involved  in  solving  a  complicated 
problem  can  be  greatly  reduced  by  making  full  use  of  all  the  available 
symmetries.  If  there  is  a  sufficient  number  of  symmetries,  then  the  system 
is  'integrable'  (in  the  sense  of  Liouville)  and  the  conservation  laws  may  then 
be  exploited  to  produce  (in  principle)  the  complete  solution  to  the  problem. 

The  Hamiltonian  function  is  also  of  great  importance  in  quantum  me- 
chanics, and  many  of  the  features  of  our  discussion  carry  over  to  that  case. 
We  have  seen  that  the  variables  appear  in  pairs.  To  each  co-ordinate  qa 
there  corresponds  a  momentum  pa.  Such  pairs  are  called  canonically  con- 
jugate. This  relationship  between  pairs  of  variables  is  of  central  importance 
in  quantum  mechanics,  where  there  is  an  'uncertainty  principle'  according 
to  which  it  is  impossible  to  measure  both  members  of  such  a  pair  simulta- 
neously with  arbitrary  accuracy. 

The  relationship  between  symmetries  and  conservation  laws  also  applies 
to  quantum  mechanics.  In  relativity,  the  transformations  we  consider  are 
slightly  different  (Lorentz  transformations  rather  than  Galilean),  but  the 
same  principles  apply,  and  lead  to  very  similar  conservation  laws. 

The  relationship  between  the  relativity  principle  and  the  familiar  con- 
servation laws  (including  the  'conservation  law'  P  =  MR)  is  of  the  greatest 
importance  for  the  whole  of  physics.  It  is  the  basic  reason  for  the  univer- 
sal character  of  these  laws,  which  were  originally  derived  as  rather  special 
consequences  of  Newton's  laws,  but  can  now  be  seen  as  having  a  far  more 
fundamental  role. 


Problems 

1.  A  particle  of  mass  m  slides  on  the  inside  of  a  smooth  cone  of  semi- 
vertical  angle  a,  whose  axis  points  vertically  upwards.  Obtain  the 
Hamiltonian  function,  using  the  distance  r  from  the  vertex,  and  the 
azimuth  angle  ip  as  generalized  co-ordinates.  Show  that  stable  circular 
motion  is  possible  for  any  value  of  r,  and  determine  the  correspond- 
ing angular  velocity,  ui.  Find  the  angle  a  if  the  frequency  of  small 
oscillations  about  this  circular  motion  is  also  u>. 
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2.  Find  the  Hamiltonian  function  for  the  forced  pendulum  considered  in 
§10.4,  and  verify  that  it  is  equal  to  T'  +  V.  Determine  the  frequency  of 
small  oscillations  about  the  stable  'equilibrium'  position  when  uj2  >  g/l. 

3.  A  light,  inextensible  string  passes  over  a  small  pulley  and  carries  a 
mass  2m  on  one  end.  On  the  other  end  is  a  mass  m,  and  beneath 
it,  supported  by  a  spring  with  spring  constant  k,  a  second  mass  m. 
Find  the  Hamiltonian  function,  using  the  distance  x  of  the  first  mass 
beneath  the  pulley,  and  the  extension  y  in  the  spring,  as  generalized  co- 
ordinates. Show  that  x  is  ignorable.  To  what  symmetry  property  does 
this  correspond?  (In  other  words,  what  operation  can  be  performed  on 
the  system  without  changing  its  energy?)  If  the  system  is  released  from 
rest  with  the  spring  unextended,  find  the  positions  of  the  particles  at 
any  later  time. 

4.  A  particle  of  mass  m  moves  in  three  dimensions  under  the  action  of 
a  central,  conservative  force  with  potential  energy  V(r).  Find  the 
Hamiltonian  function  in  terms  of  spherical  polar  co-ordinates,  and 
show  that  <p,  but  not  6,  is  ignorable.  Express  the  quantity  J2  = 
m2r4(82  +  sin2  8  ip2)  in  terms  of  the  generalized  momenta,  and  show 
that  it  is  a  second  constant  of  the  motion. 

5.  *Find  the  Hamiltonian  for  the  pendulum  hanging  from  a  trolley  de- 
scribed in  Chapter  10,  Problem  9.  Show  that  x  is  ignorable.  To  what 
symmetry  does  this  correspond? 

6.  *  Obtain  the  Hamiltonian  function  for  the  top  with  freely  sliding  pivot 
described  in  Chapter  10,  Problem  11.  Find  whether  the  minimum 
angular  velocity  required  for  stable  vertical  rotation  is  greater  or  less 
than  in  the  case  of  a  fixed  pivot.  Can  you  explain  this  result  physically? 

7.  *To  prove  that  the  effective  potential  energy  function  U(8)  of  the  sym- 
metric top  (see  §12.4)  has  only  a  single  minimum,  show  that  the  equa- 
tion U{8)  =  E  can  be  written  as  a  cubic  equation  in  the  variable 
2  =  cos  8,  with  three  roots  in  general.  Show,  however,  that  f(z)  has  the 
same  sign  at  both  z  =  ±1,  and  hence  that  there  are  either  two  roots  or 
none  between  these  points:  for  every  E  there  are  at  most  two  values  of 
8  for  which  U(8)  =  E. 

8.  Find  the  Hamiltonian  for  a  charged  particle  in  electric  and  mag- 
netic fields  in  cylindrical  polars,  starting  from  the  Lagrangian  function 
(10.29).  Show  that  in  the  case  of  an  axially  symmetric,  static  mag- 
netic field,  described  by  the  single  component  Av(p,z)  of  the  vector 
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potential,  it  takes  the  form 


) 


(Note:  Remember  that  the  subscripts  ip  on  the  generalized  momentum 
pv  and  on  the  component  Av  mean  different  things.) 
9.  A  particle  of  mass  m  and  charge  q  is  moving  around  a  fixed  point 
charge  —q'  (qq'  >  0),  and  in  a  uniform  magnetic  field  B.  The  motion 
is  confined  to  the  plane  perpendicular  to  B.  Write  down  the  Lagrangian 
function  in  polar  co-ordinates  rotating  with  the  Larmor  angular  velocity 
ll>l  =  —qB /2m  (see  §5.5).  Hence  find  the  Hamiltonian  function.  Show 
that  ip  is  ignorable,  and  interpret  the  conservation  law.  (Note  that  J z 
is  not  a  constant  of  the  motion.) 

10.  Consider  a  system  like  that  of  Problem  9,  but  with  a  charge  +q'  at 
the  origin.  By  examining  the  effective  radial  potential  energy  function, 
find  the  radius  of  a  stable  circular  orbit  with  angular  velocity  wl,  and 
determine  the  angular  frequency  of  small  oscillations  about  it. 

11.  *A  particle  of  mass  m  and  charge  q  is  moving  in  the  equatorial  plane 
2  =  0  of  a  magnetic  dipole  of  moment  /i,  described  (see  Appendix  A, 
Problem  12)  by  a  vector  potential  with  the  single  non-zero  component 
A<p  =  A*oMsm$/4T2 •  Show  that  it  will  continue  to  move  in  this  plane. 
Initially,  it  is  approaching  from  a  great  distance  with  velocity  v  and 
impact  parameter  b,  whose  sign  is  defined  to  be  that  of  pv.  Show  that 
v  and  pv  are  constants  of  the  motion,  and  that  the  distance  of  closest 
approach  to  the  dipole  is  ^(Vb2  =F  a2  ±  b),  according  as  b  >  a  or  b  <  a, 
where  a2  =  pLoqn / irmv .  (Here  qfi  is  assumed  positive.)  Find  also  the 
range  of  values  of  b  for  which  the  velocity  can  become  purely  radial,  and 
the  distances  at  which  it  does  so.  Describe  qualitatively  the  appearance 
of  the  orbits  for  different  values  of  b.  (Hint:  It  may  be  useful  to  sketch 
the  effective  radial  potential  energy  function.) 

12.  *Find  the  Hamiltonian  for  the  restricted  three-body  problem  described 
in  Chapter  10,  Problems  15  and  16.  Investigate  the  stability  of  one  of 
the  Lagrangian  'equilibrium'  positions  off  the  line  of  centres  by  assum- 
ing a  solution  where  x  —  xq,  y  —  yo,px  +  mtoyo  and  py  —  rmoxQ  are  all 
small  quantities  proportional  to  ept,  with  p  constant.  Show  that  the 
possible  values  for  p  are  given  by 


p'+ujV  + 


27MiM2w4 


4(Mi  +  M2) 
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and  hence  that  the  points  are  stable  provided  that  the  masses  M\  and 
M2  are  sufficiently  different.  Specifically,  given  that  Mi  >  M2  show 
that  the  minimum  possible  ratio  for  stability  is  slightly  less  than  25. 

13.  The  stability  condition  of  Problem  12  is  well  satisfied  for  the  case  of 
the  Sun  and  Jupiter,  for  which  M\jMi  =  1047.  Indeed,  in  that  case 
these  positions  are  occupied  by  the  so-called  Trojan  asteroids,  whose 
orbital  periods  are  the  same  as  Jupiter's,  11.86  years.  Find  for  this  case 
the  periods  of  small  oscillations  about  the  'equilibrium'  points  (in  the 
plane  of  the  orbit). 

14.  *The  magnetic  field  in  a  particle  accelerator  is  axially  symmetric  (as  in 
Problem  8),  and  in  the  plane  z  =  0  has  only  a  2  component.  Defining 
J  =  pv  —  qpA^,  show,  using  (A. 40)  and  (A. 55),  that  dJ/dp  =  —qpBz, 
and  dJ/dz  =  qpBp.  What  is  the  relation  between  ip  and  J?  Treat  the 
third  term  of  the  Hamiltonian  in  Problem  8  as  an  effective  potential 
energy  function  U(p,z)  =  J2/2mp2,  compute  its  derivatives,  and  write 
down  the  'equilibrium'  conditions  dU/dp  =  dU/dz  =  0.  Hence  show 
that  a  particle  of  mass  m  and  charge  q  can  move  in  a  circle  of  any  given 
radius  a  in  the  plane  2  =  0  with  angular  velocity  equal  to  the  cyclotron 
frequency  for  the  field  at  that  radius  (see  §5.2). 

15.  *To  investigate  the  stability  of  the  motion  described  in  the  preceding 
question,  evaluate  the  second  derivatives  oft/  at  p  =  a,  z  =  0,  and 
show  that  they  may  be  written 


(Hint:  You  will  need  to  use  the  ip  component  of  the  equation  VAB  =  0, 
and  the  fact  that,  since  Bp  =  0  for  all  p,  dBp/dp  =  0  also.)  Given  that 
the  dependence  of  Bz  on  p  near  the  equilibrium  orbit  is  described  by 
Bz  oc  (a/ p)n,  show  that  the  orbit  is  stable  if  0  <  n  <  1. 
16.  Show  that  the  Poisson  brackets  of  the  components  of  angular  momen- 
tum are 


p=a,z=0 


d2u 

dpdz 


=  0, 


d2U 
dz2 


p=a,z=0 


(together  with  two  other  relations  obtained  by  cyclic  permutation  of 
x,y,z).  Interpret  this  result  in  terms  of  the  transformation  of  one 
component  generated  by  another. 
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17.  *Show  that  the  condition  that  Hamilton's  equations  remain  unchanged 
under  the  transformation  generated  by  G  is  dG/dt  =  0  even  in  the 
case  when  G  has  an  explicit  time-dependence,  in  addition  to  its  depen- 
dence via  q(t)  and  p(t).  Proceed  as  follows.  The  first  set  of  Hamilton's 
equations,  (12.6),  will  be  unchanged  provided  that 


Write  both  sides  of  this  equation  in  terms  of  G  and  use  (12.33)  applied 
both  to  dG/dpa  and  to  G  itself  to  show  that  it  is  equivalent  to  the 
condition 


Thus  dG/dt  is  independent  of  each  pa.  Similarly,  by  using  the  other 
set  of  Hamilton's  equations,  (12.7),  show  that  it  is  independent  of  each 
qa.  Thus  dG/dt  must  be  a  function  of  t  alone.  But  since  we  can  always 
add  to  G  any  function  of  t  alone  without  affecting  the  transformation 
it  generates,  this  means  we  can  choose  it  so  that  dG/dt  =  0. 
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Dynamical  Systems  and  Their 
Geometry 

In  this  chapter  we  will  begin  to  look  at  continuous  dynamical  problems  in 
a  new,  powerful  and  largely  geometrical  way,  which  had  its  origins  in  the 
work  of  Poincare. 

This  approach  enables  us  to  move  on  from  the  solution  of  the  differential 
equations  for  a  system  having  one  set  of  parameters  and  initial  conditions 
to  the  consideration  of  the  geometry  of  all  the  possible  solutions.  Not 
only  does  this  new  perspective  bring  out  important  features  of  systems 
which  are  already  familiar,  but  it  also  enables  us  to  gain  qualitative  (and 
some  quantitative)  information  about  systems  in  which  there  is  complex 
interaction,  which  may  be  highly  nonlinear  and  accessible  by  other  methods 
only  with  difficulty,  if  at  all. 

The  ideas  are  of  very  wide  applicability  in  many  fields,  not  only  in 
mechanics,  other  branches  of  physics,  engineering  and  applied  mathematics, 
but  also  in,  for  example,  biology,  chemistry  and  economics. 

[For  some  systems  it  is  appropriate  and  useful  to  observe  at  discrete  time 
intervals  (not  necessarily  equal).  Differential  equations  are  then  replaced 
by  maps  —  see  §14.2  and  appendix  D.] 

13.1    Phase  Space  and  Phase  Portraits 

Many  systems  have  a  finite  number  n  of  distinct  dynamic  elements,  which 
may  be  represented  by  functions  Xi(t)  of  time  t  where  i  =  l,2,...,n.  It 
should  be  noted  that  n  is  not  necessarily  the  same  as  the  number  of  degrees 
of  freedom  if,  for  example,  the  system  is  mechanical.  Here  we  are  dealing 
with  the  number  of  elements  which  describe  the  dynamic  state  and  these 
might  therefore  include,  for  example,  positions  and  velocities  or  momenta 
(as  in  §12.1). 
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We  may  denote  the  state  of  the  system  by  the  vector  x(t)  and  the  n 
equations  of  motion  can  be  written  in  vector  form 


describing  the  evolution  of  the  dynamic  elements  with  'velocity'  F. 

In  practice  F  usually  contains  a  set  c  of  control  parameters  which  are 
characteristic  of  the  physical  problem,  like  masses,  spring  constants  and  so 
on. 

The  idea  is  that  we  fix  c  and  consider  the  solutions  of  (13.1)  for  var- 
ious different  prescribed  values  of  the  Xi  at  a  particular  initial  time,  so 
determining  the  evolution  of  these  different  initial  states. 

It  is  extremely  useful  to  view  (13.1)  as  the  prescription  for  the  way  in 
which  the  point  x  evolves  with  time  along  trajectories  (or  orbits)  in  an 
n-dimensional  space  —  the  phase  space  of  the  system.  The  function  F  in 
(13.1)  is  then  called  the  phase  velocity.  The  set  of  all  trajectories  in  this 
space  for  various  initial  prescribed  values  of  x,  together  with  the  velocity 
directions  upon  them,  constitutes  the  phase  portrait  of  the  system.  The 
evolutions  of  a  continuous  dynamical  system  pictured  in  this  manner  are 
often  described  as  a  flow,  by  analogy  with  fluid  motion  (see  Fig.  13.1). 

The  phase  space  is  usually  taken  to  be  Euclidean  in  its  geometry.  While 
this  is  not  absolutely  necessary,  it  is  useful  to  do  so,  although  there  are 
obvious  difficulties  in  graphical  representation  when  n  >  3. 


x  = 


(13.1) 


Fig.  13.1 
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It  is  important  to  note  that  the  first-order  system  (13.1)  is  not  as  special 
as  it  appears,  since  any  system  having  higher-order  derivatives  can  be  put 
into  this  standard  form  by  a  suitable  change  of  variables.  If  we  have 


then  we  may  write  Xi  =  dl~1x/dtl~1  for  i  =  1,2, ...  ,n,  and  recover  the 
form  (13.1).  Simple  examples  of  this  technique  (for  n  =  2)  will  be  given 
below. 

In  (13.1)  the  functions  Fj  contained  within  the  phase  velocity  F  may  not 
depend  explicitly  on  time,  and  the  system  is  then  said  to  be  autonomous. 
Of  course,  it  is  immediately  apparent  that  any  non-autonomous  system 
may  be  made  autonomous  through  the  substitution  xn+\  =  t,  but  at  the 
expense  of  raising  by  one  the  dimension  of  the  phase  space.  We  will  not 
be  concerned  further  with  this  matter  here,  but  will  concentrate  almost 
exclusively  on  autonomous  systems  [cf.  also  'natural  systems'  of  §10.1]. 

Evidently  the  phase  portrait  for  a  particular  dynamical  system  may  look 
very  different  for  different  values  of  the  control  parameters  c  in  (13.1)  and, 
for  many  such  systems,  there  is  considerable  interest  in  the  dynamical  tran- 
sitions which  occur  in  their  behaviour  at  particular  parameter  values.  The 
importance  of  this  lies  in  what  is  called  the  structural  stability  of  systems; 
abrupt  changes  at  particular  c  are  bifurcations  or  catastrophes. 

13.2    First-order  Systems  —  the  Phase  Line  (n  =  1) 

Let  us  consider  the  logistic  differential  equation  for  the  dynamic  state  vari- 
able x(t): 


Here  k,  a  are  positive  control  parameters. 

The  equation,  which  is  due  to  Verhulst,  is  a  model  of  population  growth 
(among  many  other  applications)  and  the  general  motivation  is  that  the 
growth  rate  x/x  decreases  as  the  population  x(t)  increases  on  account  of 
overcrowding  or  lack  of  resources.  Naturally  the  interest  for  this  biological 
application  is  in  xq  >  0. 


(13.2) 


with 


x(0)  =  X0. 


(13.3) 
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Now  this  equation  (13.3)  is  first-order  separable  and  it  may  be  solved 
explicitly  to  give 

x(t)  =  7  r,  ,  ; — r"T7>  f°r  xo  >  0, 

v  '      [ax0  +  (k-  <Tx0)eM-kt)}  (13.4) 

x(t)  =  0,  for  xq  =  0. 

This  solution  may  be  plotted  as  a  function  of  t  as  in  Fig.  13.2. 

It  is  evident  from  (13.4)  and  from  Fig.  13.2  that,  for  all  xq  >  0,  the 
evolution  of  x  with  t  is  such  that  x  — >  k/a.  For  0  <  xq  <  k/a  the  'sigmoid' 
curves  obtained  represent  the  way  in  which  the  exponential  growth  (which 
would  be  present  if  a  =  0)  is  inhibited  by  overcrowding.  For  xq  >  k/a  there 
is  steady  decay  of  the  population  towards  the  value  k/a.  Since  x  — >  k/a 
as  t  — >  oo  in  each  case,  then  the  equilibrium  population  x(t)  =  k/a  is 
said  to  be  asymptotically  stable  and  it  is  an  attractor.  The  equilibrium 
population  x{t)  =  0  is  unstable,  in  that  it  is  only  realizable  if  xq  =  0.  Any 
positive  change  in  xq,  however  small,  results  in  explosive  growth  until  the 
overcrowding  takes  effect  and  x  is  attracted  towards  k/a,  as  above.  So 
x(t)  =  0  is  a  repeller. 

Now  here  we  have  been  fortunate  to  be  able  to  arrive  at  the  complete 
solution  of  (13.3)  in  the  form  (13.4).  The  comments  detailed  above  result 
from  interpretation  of  (13.4)  and  of  Fig.  13.2  derived  from  it.  However,  the 
important  qualitative  behaviour  of  the  solutions  is  obtainable  more  directly 
from  (13.3)  by  considering  the  phase  space  and  the  phase  portrait. 


Fig.  13.2 
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Example:  Phase  portrait  of  the  logistic  system 

Find  the  qualitative  features  of  the  logistic  system  by  considering 
the  phase  portrait. 

The  phase  space  is  here  one-dimensional  (a  phase  line).  (See  Fig.  13.3.) 
In  constructing  the  phase  portrait  for  this  example  the  equilibria  x  =  0, 
x  =  k/a  are  found  by  putting  x  =  0  in  (13.3).  It  is  important  to  note 
that  these  equilibria,  or  critical  points,  are  the  only  values  of  x  at  which 
x  can  change  sign.  The  arrow  directions  are  those  of  the  phase  velocity 
and,  here,  the  complete  portrait  can  be  found  by  testing  just  a  single 
x  value  away  from  the  equilibria.  From  this  we  see  immediately  that 
x  =  k/a  is  an  attractor  and  x  =  0  is  a  repeller. 


O  klu  x 

Fig.  13.3 

The  above  is  sufficient  argument,  but  with  an  eye  to  developments  for 
higher-order  systems  we  can  consider  a  small-amplitude  approximation 
near  to  each  critical  point,  very  similar  to  the  analysis  carried  out  in,  for 
example,  §2.2.  If  we  write  x{t)  =  k/a  +  £(£),  where  £(£)  is  small  enough 
for  quadratic  terms  involving  it  to  be  neglected,  then  we  find  that 

e  =  -fc£  (i3.5) 

to  first  order  in  £  and  £(t)  — >  0  exponentially  as  t  — >  oo,  since  k  is 
positive,  confirming  of  course  that  x  =  k/a  is  an  attractor.  Note  that 
this  solution  of  (13.5)  is  consistent  with  our  assumption  that  quadratic 
terms  in  £  could  be  neglected.  The  other  critical  point  x  =  0  may  be 
treated  similarly  and  confirmed  as  a  repeller. 

An  important  point  to  note  here  is  that  for  this  system  and  for  other 
such  first-order  systems  the  qualitative  behaviour  is  completely  determined 
by  the  position  of  the  critical  points,  by  the  local  behaviour  on  the  phase 
line  near  to  them  and  through  the  strong  geometrical  requirement  that  x 
cannot  change  sign  at  any  other  value  of  x. 

We  might  consider  the  equation 


x  =  kx  —  ax2  —  /, 


(13.6) 
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where  /  represents  a  positive  constant  extraction  rate.  For  example,  x(t) 
could  be  the  population  of  fish  in  a  lake  and  /  the  rate  of  fishing  this 
population.  The  exact  solution  is  now  rather  more  awkward  to  obtain  and 
the  use  of  the  one-dimensional  phase  space  provides  clear  advantage.  In 
particular,  the  extra  control  parameter  /  which  has  been  introduced  can 
be  varied  and  predictions  made  about  the  evolution  of  x  from  resulting 
changes  in  the  phase  portrait.  (See  Problem  1.) 

The  analysis  given  above  will  be  generalized  in  following  sections  to  deal 
with  higher-order  systems. 

Naturally  for  many  biological  systems  a  discrete,  rather  than  continuous, 
model  is  more  appropriate  [see  Appendix  D]. 

13.3    Second-order  Systems  —  the  Phase  Plane  (n  =  2) 

Again  let  us  proceed  by  considering  a  particular  and  familiar  example. 

Example:  Simple  harmonic  oscillator 

Determine  the  phase-space  trajectories  of  the  simple  harmonic 
oscillator. 

The  oscillator  equation  (2.13)  may  be  written 


x,  =  —uj2x,       u)  =  y/k/m,  (13.7) 

which  is  in  the  form  (13.2)  and  which  may  be  converted  to  the  form 
(13.1)  as 

x  =  y, 

(13-8) 

y  =  —ujx. 

The  trajectories  of  this  autonomous  system  in  the  (x,  y)  phase  plane  are 
evidently  given  by  the  solutions  of 

aV      V        lo2x  .„  „  . 

-p  =  -  =  ,  13-9 

ax      x  y 

which  may  be  solved  easily  to  give 

WV  +  y2  =  A2,        with       A2  constant.  (13.10) 

The  phase  portrait  is  a  family  of  ellipses  centred  at  the  origin.  (See 
Fig.  13.4.)  Here,  along  each  trajectory  the  energy  E  of  the  oscillator  is 
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Fig.  13.4 

constant  and  given  by  E  =  \mA2.  Evidently  (x,y)  =  (0,0)  is  a  critical 
point  of  the  system  (13.8)  and  any  small  perturbation  leads  to  the  system 
traversing  an  elliptical  trajectory  around  this  equilibrium  point.  Each 
trajectory  is  closed  and  is  therefore  a  periodic  orbit. 


In  this  case,  we  were  able  to  find  the  trajectories  by  direct  integration  of 
(13.9).  In  the  case  of  a  general  system  this  may  well  not  be  easy,  or  possible, 
except  by  numerical  computation. 

The  general  second-order  autonomous  system  with  which  we  are  pre- 
sented is 


x  =  F(x,y), 
V  =  G(x,y), 


(13.11) 


where  F,  G  are  suitable  functions.  It  is  apparent  that  the  equation 


dy 
dx 


G 
F 


(13.12) 


certainly  determines  the  slope  of  the  unique  trajectory  in  the  phase  plane 
at  each  point  except  at  the  critical  points  (xo,yo),  for  which  F(xo,yo)  = 
0  =  G(xo,yo).  Each  such  (xo,yo)  is  an  equilibrium  solution  of  the  system 
and  trajectories  can  only  intersect  at  such  points  and  nowhere  else. 

The  interesting  question  to  ask,  from  the  physical  point  of  view,  con- 
cerns the  fate  of  the  system  when  it  is  displaced  slightly  from  such  a  critical 
point.  The  system  could,  for  general  small  displacements: 
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1.  Move  so  as  to  tend  towards  (xq,  yo)  as  t  — >  oo,  in  which  case  we  say  the 
system  is  asymptotically  stable. 

2.  Move  increasingly  away  from  (xo,  yo),  in  which  case  we  say  it  is  unstable. 

3.  Move  in  an  arbitrarily  small  neighbourhood  of  (xo,yo)  without  neces- 
sarily tending  towards  (xo,yo)  as  t  — >  oo,  in  which  case  we  say  it  is 
stable. 

The  simple  harmonic  oscillator  is  therefore  stable  at  a;  =  0,  y  =  x  =  0  — 
hardly  a  surprising  conclusion! 

The  behaviour  of  the  trajectories  near  to  each  critical  point  (xo,yo)  in 
the  phase  plane  can  be  found  by  an  analysis  similar  to  that  introduced  in 
§13.2  using  x  =  xo  +  £,  y  =  yo  +  rj,  in  which  £,  r]  are  small  and  we  expand 
functions  F(x,y),G(x,y)  locally.  We  obtain 


+  T) 


\dxj0  \dyj0 

(dG_ 

\dx 


dG 
dy 


(13.13) 


Here  the  partial  derivatives  are  evaluated  at  the  critical  point  (xo,yo)  and 
F±,Gi  are  generally  quadratic  in  £, rj.  In  matrix  notation,  these  equations 
are 


~i~ 

d 

=  M 

=  ~At 

+  higher-order  terms, 


(13.14) 


and  the  2x2  Jacobian  matrix  M  has  constant  entries  which  are  special  to 
each  critical  point.  In  nearly  all  cases  the  correct  behaviour  of  the  almost 
linear  system  near  £  =  0  =  r/  is  given  by  the  behaviour  of  the  linear  system 
which  is  obtained  by  neglecting  the  higher-order  terms  F\,  G\  in  the  above. 
The  behaviour  near  (xo,yo)  of  the  linear  system  may  be  classified  using 
the  eigenvalues  A  of  M.  We  seek  a  solution  of  the  linear  system  which  is  a 
linear  combination  of  'modes',  in  each  of  which  f ,  r/  are  constant  multiples  of 
eAt.  The  analysis  appears  similar  to  that  involved  in  the  calculation  of  the 
frequencies  of  the  normal  modes  in  §11.3,  but  we  are  dealing  with  matrices 
M  which  are  not  necessarily  symmetric,  so  that  the  roots  of  the  quadratic 
characteristic  equation  for  A  need  not  be  real.  Evidently,  the  signs  of  the 
real  parts  of  these  roots  are  crucial  in  determining  whether  trajectories  near 
to  (xo,yo)  go  inwards  towards  this  critical  point  or  go  outwards  from  it. 
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Details  of  the  classification  of  critical  points  in  the  various  cases  are  given 
in  Appendix  C. 

The  curves  which  go  directly  in  and  directly  out  of  a  critical  point  are 
called  the  stable  and  unstable  manifolds  respectively  and,  in  the  linear  and 
almost  linear  systems,  they  correspond  directly  and  are  tangent  to  one 
another.  We  note  that  the  critical  point  (xo,yo)  =  (0)0)  for  the  simple 
harmonic  oscillator  is  a  centre,  i.e.  nearby  trajectories  form  closed  curves 
around  it.  (See  Appendix  C,  Fig.  C.6.) 

The  importance  of  the  local  behaviours  near  critical  points  of  system 
(13.11)  is  that  we  can  usually  obtain  a  qualitative  picture  of  the  full  phase 
portrait  of  the  system  by  this  means.  The  trajectories  are  constrained  in 
that  they  cannot  intersect  elsewhere  and  they  are  locally  parallel  to  the  x,  y 
axes  respectively  when  crossing  the  curves  G{x,  y)  =  0,  F(x,  y)  =  0. 

The  fate  of  particular  initial  states  becomes  apparent  from  the  phase 
portrait.  Detailed  quantitative  information  can,  if  required,  be  obtained 
from  exact  computational  plots. 

Apart  from  systems  for  which  (13.12)  may  be  integrated  in  straightfor- 
ward fashion  to  give  the  curve  family  f(x,y)  =  constant,  there  are  some 
special  cases  of  particular  interest  in  mechanics. 


Conservative  systems  with  one  degree  of  freedom 

For  the  systems  considered  in  §2.1  we  have 

F  ...     „  dV(x) 


x  =  y, 


y 


with  F 


m 


dx 


and  we  find  the  trajectories  are  given  by 


\my2  +  V(x)  =  E,        y  =  ±\j ^-[E -V{x)]. 


(13.15) 


(13.16) 


When  the  potential  V(x)  is  known  the  phase  portrait  may  be  sketched 
immediately.  (See  Fig.  13.5.) 
Since 


M 


0  1 

V"(xq)  0 


at  equilibria  (xo,0),  the  eigenvalues  of  M  when  V(x)  is  a  local  minimum 
are  pure  imaginary,  since  V"(xq)  is  positive,  and  (2:0,  0)  is  a  centre,  which 
is  stable.    Similarly,  local  maxima  of  V(x)  lead  to  saddles  in  the  phase 
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(b)  Phase  portrait 

Fig.  13.5 


portrait  and  these  are  unstable,  i.e.  most  trajectories  approach  the  critical 
point,  but  then  turn  away.  (See  Appendix  C,  Fig.  C.2.) 

An  important  physical  example  is  provided  by  the  simple  pendulum  of 
§2.1. 

Example:  Simple  pendulum 

Determine  the  possible  types  of  motion  of  a  simple  pendulum  by 
considering  the  phase  portrait. 

From  the  27r-periodic  potential  function  V(8)  sketched  in  Fig.  2.1  we 
can  sketch  the  phase  portrait.  (See  Fig.  13.6.) 

The  phase  portrait  is  also  27r-periodic.  The  simple  oscillatory  mo- 
tions corresponding  to  closed  orbits  around  (0,  0)  are  called  librations 
and,  since  V(6)  =  mgl(l  —  cos 9),  it  is  necessary  that  E  <  2mgl  for  these 
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e 


Fig.  13.6 


to  occur  in  Fig.  2.1).  If  E  >  2mgZ  (E(2)  in  Fig.  2.1)  the  pendu- 

lum makes  complete  revolutions  or  rotations,  either  in  the  positive  or 
negative  sense.  In  the  special  case  E  =  2mgl  the  trajectory  is  a  separa- 
trix  connecting  the  saddles  (— n,  0),  (n,  0),  and  corresponds  to  motions 
of  the  pendulum  in  which  it  is  vertically  above  the  point  of  support  at 
t  — >  ±oo.  The  period  of  libration  motion  of  the  pendulum  increases  with 
increasing  amplitude  and  becomes  infinite  as  the  phase-plane  trajectory 
corresponds  more  and  more  closely  to  the  separatrices  from  the  inside. 

It  should  be  noted  that  (13.15)  in  general,  and  hence  the  simple  har- 
monic oscillator  and  simple  pendulum  in  particular,  are  Hamiltonian  sys- 
tems with  one  degree  of  freedom.  For  such  systems  the  critical  points  in 
the  phase  plane  are  either  centres  or  saddles  —  spirals  and  nodes  do  not 
occur! 

Gradient  systems 

These  systems  have  equations  of  the  form 


x  =  — 


dU(x,y) 
dx 


y 


dU(x,y) 
dy 


(13.17) 


so  that  in  (13.1)  F  takes  the  form  —  Vf7.  The  trajectories  in  the  phase 
plane  are  everywhere  perpendicular  to  the  curves  U(x,y)  =  constant,  i.e. 
they  are  orthogonal  trajectories  of  the  level  curves  (contours)  of  U.  Here 
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almost  all  trajectories  tend  towards  local  minima  of  the  function  U(x,y) 
and  away  from  local  maxima.  For  these  systems  the  critical  points  in  the 
phase  plane  are  nodes  and  saddles  —  spirals  and  centres  do  not  occur. 

A  physical  example  of  a  gradient  system  is  the  'runoff'  problem  of  water 
flowing  downhill  through  surface  soil.  (See  Fig.  13.7  and  Problem  5.) 


Fig.  13.7 


13.4    Prey— Predator,  Competing-species  Systems  and  War 

The  general  method  of  analysis  in  the  phase  plane  introduced  in  §13.3  finds 
very  wide  application. 

An  interesting  example,  which  is  deceptively  simple,  leads  to  the  equa- 
tions 

x  =  ax  —  bxy, 
y  =  -cy  +  dxy, 

where  a,  b,  c,  d  are  positive  constants.  These  equations  have  been  used  by 
Volterra  (1926)  to  model  biological  populations  and  by  Lotka  (1920)  to 
model  chemical  reactions.  They  are  usually  known  as  the  Lotka-Volterra 
system. 


(13.18) 
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The  biological  application  is  to  a  simple  ecological  model  of  population 
dynamics,  where  x,y  are  prey,  predators  respectively  (e.g.  rabbits,  foxes) 
and  the  terms  proportional  to  xy  model  the  effect  of  interaction  between 
these  species  through  encounters  —  advantageous  to  the  predators,  disad- 
vantageous to  the  prey.  Without  the  interactions  the  prey  would  proliferate 
exponentially  in  time,  while  the  predators  would  die  out.  The  parameters 
a,  b,  c,  d  are  therefore  rates  of  growth,  decay  and  competitive  efficiency. 

The  ecology,  even  of  such  isolated  sets  of  species,  is,  of  course,  not  this 
simple,  but  the  consequences  of  idealized  models  prove  useful  as  a  basis  for 
more  extensive  investigations. 

Example:  Lotka— Volterra  system 

Find  the  critical  points  and  their  nature  for  the  Lotka- Volterra 
system. 

In  the  local  linear  analysis  there  are  two  critical  points  for  the  system 
(13.18): 


(0,  0)  with  M 


a  0 
0  -c 

We  have  a  saddle  with  eigenvectors  parallel  to  the  x,  y  co-ordinate 


so  that  the  eigenvalues  are  Ai  =  a,  A2  =  —a 


axes. 

be 


with  M 


0 


so  that  the  eigenvalues  are  imaginary: 


ad 

_T  0 

A1.2  =  ±i\/ac.  We  have  a  centre. 

The  question  arises  as  to  whether  the  centre  is  a  true  centre  for  the  full 
system  in  this  case  (see  Appendix  C  and  Problem  6).  That  it  is  such  is 
demonstrated  by  the  realization  that  (13.18)  leads  to 

¥  =  y±^±,  (i3.i9) 

da;       x(a  —  by) 
which  is  separable,  so  that  it  may  be  integrated  to  give 

f(x,  y)  =  — cln  \x\  +  dx  —  a  In  \y\  +  by  =  constant.  (13.20) 

The  phase  portrait  may  be  sketched  as  the  set  of  contours  of  f(x,y). 
(See  Fig.  13.8.)  Naturally,  the  biological  interest  is  confined  to  the  first 
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Fig.  13.8 


quadrant  (x  >0,y>  0)  and  the  trajectories  are  closed  curves  around  the 
critical  point  at  (c/rf,  a/fe),  so  confirming  the  local  linear  analysis  that 
this  is  indeed  a  centre. 


Small  oscillations  around  the  point  of  stable  equilibrium  have  frequency 
sfac  and  period  T  =  2ir / \/ac,  which  increases  with  the  size  of  the  orbit, 
although  an  explicit  analytic  demonstration  of  this  is  not  easy  and  will  not 
be  given  here.  However,  since 

£  (^jdt=  f\a-by)dt=  [\nx(t)]*  =  0,  (13.21) 


we  find  that 

l-T 


a 

ydt  =  -b, 


so  that  the  mean  value  of  y  around  any  periodic  orbit  is  given  by  the 
equilibrium  value  a/b.  Similarly  x  =  c/d. 
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There  are  ecological  consequences  of  this  model  in  that  there  are  cyclic 
variations  in  x,  y,  which  are  not  in  phase,  and  changes  in  parameter  values 
can  be  investigated.  The  effect  here  of  a  proportional  extraction  rate  is  to 
reduce  a  and  increase  c,  so  that  the  mean  populations  of  prey /predators 
respectively  increase/decrease  —  the  paradoxical  advantage  to  the  prey  of 
this  extraction  is  due  to  the  removal  of  some  natural  predators,  and  is 
known  as  Volterra's  principle. 

The  model  has  one  further  interest  for  us  in  that  it  is,  in  effect,  a 
Hamiltonian  system,  as  detailed  in  Chapter  12.  For  the  first  quadrant  we 
may  substitute  x  =  ep,  y  =  eq  in  order  to  obtain  the  Hamilton  canonical 
form 

dH 

q  =  -c  +  dep  =  — , 
dp 

13.22 

dH  V  ' 

p  =  a-beq  =  — — , 
oq 

where  H(q,p)  =  —cp  +  dep  —  aq  +  beq.  For  this  autonomous  system,  H  is  a 
conserved  quantity.  (See  §12.2.)  In  the  original  variables  x,  y  the  function 
H(q,p)  becomes  equivalent  to  f(x,y)  in  (13.20)  and  it  is  an  'energy',  but 
not  mechanical  energy. 

More  complicated  systems  involving  interactions  are  very  common  and 
the  outcomes  are  even  less  intuitively  transparent. 

For  example 

x  =  k\  x  —  en  x2  —  aixy, 

1  2  (13.23) 

y  =  k2y-  <T2y  -  a2xy, 

where  fci,  k2,  &i,  <j2,  «i,  ol2  are  positive  control  parameters,  models  logistic 
behaviour  (see  (13.3))  for  each  of  x,  y  with  similarly  destructive  interactions. 
This  system  is  certainly  not  usually  Hamiltonian,  but  the  phase  portrait 
is  accessible  by  the  local  linear  analysis  near  the  critical  points,  of  which 
there  may  be  four.  The  influence  of  the  parameters  is  crucial.  An  example 
of  such  a  portrait  is  given  in  Fig.  13.9,  for  which  the  parameters  have  been 
chosen  to  be  k\  =  G\  =  a.\  =  1,  k2  =  |,  cr2  =  \,  a2  =  |.  (See  Problem  7  for 
the  details.) 

Here  there  are  two  asymptotically  stable  nodes  at  (1,0),  (0,2),  an  un- 
stable node  at  (0,0)  and  a  saddle  (unstable)  at  (j,|).  Nearly  all  initial 
conditions  lead  to  extinction  for  one  or  other  species  and  this  is  an  example 
of  the  'Principle  of  Competitive  Exclusion'  [Charles  Darwin,  The  Origin  of 
Species  (1859)]. 
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Fig.  13.9 

In  the  general  case  (13.23)  and  where  there  is  a  critical  point  in  the 
first  quadrant,  an  extinction  is  inevitable  when  k\Oi  <  foai,  fc20"i  <  fci<ai2 
(so  that  cri(T2  <  «itt2)-  On  the  other  hand,  peaceful  coexistence  via  an 
asymptotically  stable  node  in  the  first  quadrant  is  guaranteed  when  these 
inequalities  are  all  reversed  (again,  see  Problem  7). 

In  human  conflict  there  are  mathematical  theories  of  war.  The  simplest 
such  model  involves  forces  x(t),y(t)  which  are  effectively  isolated  apart  from 
the  confrontation  between  them. 

Example:  Combat  model 

The  simple  system 

x  =  —ay, 

(13.24) 

y  =  -bx, 

where  a,  b  are  positive  constants,  models  the  attrition  rates  of 
each  force  on  the  other.  Determine  how  the  final  outcome  of  a 
conflict  depends  on  the  initial  conditions  in  this  model. 
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Evidently  (0,  0)  is  the  only  critical  point  and  it  is  a  saddle.  Indeed 
the  trajectories  are  given  by 

ay2  -  bx2  =  ay2  -  bx20  =  K,  (13.25) 

where  xq,  yo  are  the  initial  strengths  at  t  =  0.  These  are  hyperbolic  arcs 
in  general  (see  Fig.  13.10). 


Fig.  13.10 


Of  course  x  'wins'  if  K  <  0  and  y  'wins'  if  K  >  0.  One  interesting 
feature  is  that  the  trajectories  diverge  —  the  discrepancy  between  the 
two  sides  becomes  more  marked  as  time  progresses,  so  the  likely  outcome 
becomes  clearer  and  the  weaker  party  may  be  induced  to  surrender.  The 
most  damaging  conflicts  are  those  that  start  near  the  line  K  =  0  which 
leads  to  a  'draw'  in  which  both  sides  are  completely  annihilated,  since  in 
this  case  the  forces  were  initially  perfectly  balanced. 

The  measure  of  fighting  effectiveness  of  e.g.  force  y  as  ay2,  at  t  =  0, 
i.  e.  quadratic  in  numerical  strength,  is  Lanchester's  Law  of  Conventional 
Combat  (1916).  Through  the  ages,  the  general  success  of  the  empirical 
rule  'Divide  and  conquer!'  is  confirmation  of  the  effect  of  this  simple 
qualitative  law. 
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One  may  also  model  not  the  conflict  itself  but  the  arms  race  leading  up 
to  it.  Richardson  (1939)  introduced  such  a  model  in  the  form 


Here  x(t),y(t)  are  war  potentials/armaments  of  two  nations/coalitions, 
(ai,a2)  are  response  parameters  to  the  armament  of  the  opponent  (note 
that  in  contrast  to  the  example  above,  these  coefficients  enter  with  pos- 
itive sign),  (ci,c2)  are  cost  deterrents  and  g±,g2  basic  grievances.  (See 
Problem  8.) 

13.5    Limit  Cycles 

So  far  we  have  looked  at  phase  portraits  in  which  the  attractors  and  repellers 
have  been  equilibria  which  are  isolated  critical  points.  For  some  systems 
there  are  other  possibilities.  Rayleigh's  equation  (1883) 


where  e  is  a  positive  control  parameter,  has  been  used  to  model  various  self- 
excited  oscillations,  for  example  the  bowing  of  a  violin  string.  By  reference 
to  the  damped  oscillator  of  §2.5  it  is  evident  that  (13.27)  has  positive  or 
negative  damping  respectively  according  as  \x\  >  1  or  \x\  <  1.  The  sort 
of  physical  system  modelled  by  this  equation  is  illustrated  schematically  in 
Fig.  13.11,  where  a  block  is  acted  upon  by  a  spring  and  by  a  dry  frictional 
force  at  contact  with  a  belt,  which  is  itself  moving  uniformly.  We  may  write 


x  =  a2y  -  cix  +  gi, 
y  =  a1x-  c2y  +  g2. 


(13.26) 


x  —  ex(l  —  x2)  +  x  =  0, 


(13.27) 


Moving  belt 


Fig.  13.11 


(13.27)  in  the  form 


x  =  y, 

y  =  -x  +  ey(l  -  y2). 


(13.28) 
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The  only  critical  point  is  at  (0,0)  and  is  always  unstable  (spiral  or  node). 
However  trajectories  come  inwards  from  large  values  of  x  and  y.  The  phase 
portraits  for  progressive  e  values  are  sketched  in  Fig.  13.12. 


Fig.  13.12 

In  each  case  there  is  a  periodic  orbit  attractor,  which  is  an  asymptoti- 
cally stable  limit  cycle.  Other  trajectories  tend  towards  this  cycle  as  t  — >  oo 
and  the  form  of  the  cycle  depends  crucially  on  the  parameter  e.  As  e  in- 
creases, the  variation  of  x  with  time  t  changes  from  essentially  sinusoidal 
in  Fig.  13.12(a)  to  a  markedly  non-symmetrical  oscillation  in  Fig.  13.12(c). 
There  is  then  a  sharp  rise  and  fall  in  x  as  t  varies  (see  Fig.  13.13(a)).  For 
this  type  of  oscillation,  which  is  called  a  relaxation  oscillation,  there  is  a 
mechanical  analogy  in  the  seesaw  mechanism  (see  Fig.  13.13(b)).  Here  liq- 
uid is  added  at  a  constant  rate  to  the  reservoir  on  the  right-hand  side  and 
periodically  tips  the  balance,  so  that  the  reservoir  is  emptied  suddenly  and 
the  cycle  can  be  repeated. 
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Fig.  13.13 


There  are  many  mechanical  and  electrical  examples  of  Rayleigh's  equa- 
tion and  of  another  which  may  be  derived  from  it  by  the  substitution 
x  =  v/y/3.  The  equation  for  v  is  then  (see  Problem  10) 


ev{l  -  v2)+v  =  0. 


(13.29) 


This  equation  was  studied  by  Van  der  Pol  (1926)  in  order  to  model  various 
electrical  circuits  and  in  his  study  of  the  heartbeat. 

A  spectacular  example  of  a  limit  cycle  was  in  the  behaviour  of  the 
Tacoma  Narrows  bridge  near  Seattle.  The  cycle  of  torsional  oscillations, 
up  to  1.5  m  in  the  bridge  deck,  drew  its  energy  from  a  constant  wind  flow 
across  it.  The  bridge  became  known  as  'Galloping  Gertie',  at  least  up  to 
its  spectacular  collapse  on  7  November  1940,  fortunately  and  memorably 
recorded  on  film. 

Relaxation  oscillations  have  also  been  used  to  model  natural  phenom- 
ena, like  geysers  and  earthquakes. 

Analytic  demonstration  of  the  existence  of  limit  cycles,  both  attractors 
and  repellers,  is  beyond  our  scope  here,  but  it  is  important  to  note  that  some 
systems  exhibit  bifurcation  phenomena  as  a  control  parameter  is  varied. 


Hopf  bifurcation 

At  the  end  of  §13.1  we  referred  to  the  way  in  which  critical  points  and 
phase  portraits  may  have  very  different  character  when  control  parameters 
are  varied.  Consider  now  that  the  stability  of  a  focus  can  change  as  a  control 
parameter  c  is  varied,  in  such  a  way  that  the  system  dynamics  'gives  birth' 
to  a  limit  cycle.    This  is  usually  known  as  a  Hopf  bifurcation  and  it  is 
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y 


(a)  Supercritical  Hopf  bifurcation 


y 


(b)  Subcritical  Hopf  bifurcation 


Fig.  13.14 

termed  supercritical  or  subcritical  respectively  according  as  the  resulting 
limit  cycle  is  an  attractor  or  repeller.  For  the  supercritical  bifurcation  an 
asymptotically  stable  focus  becomes  unstable  at  a  critical  value  cq  at  which 
an  attracting  limit  cycle  appears.  (See  Fig.  13.14(a).)  For  the  subcritical 
bifurcation  an  unstable  focus  becomes  asymptotically  stable  at  the  critical 
value  of  c,  at  which  a  repelling  limit  cycle  appears.  (See  Fig.  13.14(b).) 

The  effect  of  a  supercritical  Hopf  bifurcation  is  relatively  benign  ('soft') 
in  that  the  change  results  in  the  system  remaining  very  much  in  the  same 
neighbourhood  of  the  phase  plane  when  it  is  subject  to  a  perturbation. 
However,  near  a  subcritical  Hopf  bifurcation  even  a  small  perturbation  can 
take  the  system  outside  the  repelling  limit  cycle  and  then  far  away  in  the 
phase  plane,  despite  the  asymptotic  stability  of  the  critical  point  in  the 
linear  analysis  —  the  effect  is  'hard'. 

The  'Brusselator'  is  a  simple  model  of  chemical  oscillation  proposed  by 
Prigogine  and  Lefever  (1968). 
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,  (13.30) 


Example:  The  'Brusselator' 

The  chemical  concentrations  x(t),  y(t)  of  two  particular  reactants 
are  related  by  the  equations: 

x  =  a  —  (1  +  b)x  +  x2y 
y  =  bx  —  x2y. 

Here  a,  b  are  concentrations  of  other  reactants  which,  once  cho- 
sen, are  kept  constant,  with  a,  b  both  positive.  Find  the  nature 
of  the  attractors  for  this  model. 

There  is  only  one  critical  point,  at  (a,b/a),  with  Jacobian  matrix 

-1  a2 
-b  -a1 

In  consequence  the  critical  point  is  asymptotically  stable  when  b  <  1+a2. 
When  b  >  1  +  a2  the  critical  point  is  unstable  (a  spiral  or  node  according 
as  b  <  or  >  (1  +  a)2).  At  b  =  1  +  a2  there  is  a  supercritical  Hopf 
bifurcation,  so  that  the  unstable  critical  point  is  accompanied  by  a  limit- 
cycle  attractor  (see  Fig.  13.15,  where  a  =  1, 6  =  3). 


Fig.  13.15    Brusselator  limit  cycle  attractor  (a  =  1,  b  =  3). 
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13.6    Systems  of  Third  (and  Higher)  Order 

We  have  seen  that,  for  first-order  systems,  there  are  point  attractors  and  re- 
pellers  and,  for  second-order  systems,  there  may  also  be  limit  cycles.  Many 
higher-order  systems  are  special,  in  that  their  dynamical  behaviour  is  effec- 
tively lower-dimensional,  through,  for  example,  the  existence  of  constants 
of  the  motion.  Some  important  specific  examples  of  this  will  be  given  in 
Chapter  14  for  systems  of  fourth  order  and  beyond,  but  in  this  section  we 
will  consider  third-order  systems  which  exhibit  certain  properties  typical  of 
systems  of  order  higher  than  second. 


Rigid-body  rotation 

Consider  the  equations  of  rotational  motion  of  a  rigid  body,  which  were 
introduced  in  §9.8.  When  the  rotation  is  about  the  centre  of  mass  and 
external  forces  have  zero  moment  we  may,  for  convenience,  choose  to  ex- 
press the  equations  (9.41)  in  terms  of  angular  momentum  components 
(Ji,  J2,  J3)  =  (Iiui,hv2,hv3)  to  obtain 

Ji  -  (%^)  J2  J'i  =  0  (13.31) 

and  two  similar  equations  obtained  by  cyclic  interchange  of  1,2,3.  In  the 
(Ji,  J2,  J3)  phase  space  each  trajectory  is  then  the  intersection  of  two  sur- 
faces (see  Problem  12) 

sphere:        J\  +  Jf  +  Jf 

j2        J2  J2 

ellipsoid:   ~r  +  ~r  +  ~r 
h 

with  J,  T  constant.  The  first  of  these  relations  expresses  the  constancy  of 
the  magnitude  of  the  angular  momentum  vector  J  and  the  second  expresses 
the  constancy  of  the  kinetic  energy  T.  (See  Fig.  13.16,  with  principal 
moments  of  inertia  such  that  I\  <  I2  <  I3.) 

If  we  take  J  to  be  fixed  and  look  at  the  critical  points  on  the  correspond- 
ing phase  sphere,  it  is  evident  that  Fig  13.16  confirms  the  results  obtained 
in  §9.8,  since  the  saddle-point  geometry  leading  to  instability  is  at  the  axis 
of  the  intermediate  principal  moment  of  inertia,  while  the  centres  are  at  the 
axes  of  the  largest  and  the  smallest  moment.  The  result  holds,  of  course, 
for  all  rigid  bodies  (from  a  match-box  to  a  space  station),  but  it  is  often 
known  as  the  tennis  racquet  theorem. 


2T, 


(13.32) 
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Fig.  13.16 

When  there  is  axial  symmetry  of  the  rigid  body,  as  in  §9.9,  there  is  sim- 
ilar symmetry  in  the  phase  space.  (See  Fig.  13.17,  with  principal  moments 
of  inertia  such  that  h  =  h  <  h-) 

In  this  case  J3  is  constant  so  that  J3  =  I3Q  (say),  while  the  components 
Ji,  J2  of  angular  momentum  are  simple  harmonic  with  frequency  [(73  — 
7i)/7i]f2.  Relative  to  axes  fixed  in  space  J  is  itself  fixed  and  the  angular 
velocity  vector  u>  precesses  around  J  in  the  way  described  in  §9.9.  In 
the  case  Wi,W2  *C  O  then  this  precession  rate  is  given  approximately  by 
(73/7i)0.  Hence  the  precession  rate  for  the  Chandler  wobble  of  the  Earth 
is  about  O  (i.e.  once  per  day),  while  the  precession  rate  is  about  |0  for  a 
spun  pass  with  an  American  football  and  about  20  for  a  spun  coin,  discus 
or  dinner  plate! 

The  problem  of  rotational  motion,  when  the  external  force  moment  G  is 
not  special,  can  be  very  complicated,  although  it  is  of  course  an  important 
one  for  many  feedback  control  problems  in,  for  example,  the  aerospace 
industry.  Some  of  the  complexity  which  may  result  is  indicated  by  the 
following  examples  of  other  physical  systems. 
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Fig.  13.17 


The  Lorenz  system 

The  system  of  equations 


x  =  a(y-  x), 

y  =  px  -  y  -  xz, 

z  =  —j3z  +  xy, 


(13.33) 


where  a,  p,  f3  are  positive  control  parameters,  is  an  important  system  which 
was  originally  put  forward  by  Lorenz  in  1963  to  provide  a  very  simple  and 
idealized  model  of  convection  in  a  slab  of  fluid.  The  aim  was  to  gain  insight 
into  the  dynamics  of  weather  systems.  (See  Fig.  13.18.) 

For  a  general  discussion  of  the  origins  of  this  system,  see  Lorenz,  The 
Essence  of  Chaos,  UCL  Press  (1993). 

The  parameters  have  the  following  physical  meaning,  after  scaling: 

•  a  quantifies  the  ratio  of  diffusion  rates  of  momentum  and  heat  in  the 
fluid  (a  =  kinematic  viscosity /thermal  conductivity). 

•  p  quantifies  the  temperature  difference  applied  across  the  slab  of  fluid. 

•  f3  quantifies  the  slab  geometry  through  an  aspect  ratio. 
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T=T„ 

/////////////////;//////;///////////////// 


8 


Slab  of 
fluid 


77777777777777777777777777777777777777777, 
T=T{  r,-T0  maintained  constant 

positive  temperature  difference 


Fig.  13.18 


Despite  its  geophysical  origin  the  system  (13.33)  and  its  wide  range  of 
behaviours  have  been  the  subject  of  extensive  study,  particularly  because 
the  system  finds  application  in  other  physical  contexts,  for  example  in  the 
study  of  lasers. 

It  is  instructive  (from  both  a  practical  and  a  theoretical  point  of  view) 
to  look  at  the  (x,  y,  z)  phase  space,  say  with  a,  (5  fixed  and  with  a  sequence 
of  different  values  of  p.  We  can  find  the  critical  points  of  the  system  and 
examine  their  local  character  through  the  methods  of  §13.3  and  Appendix 
C,  suitably  extended.  However  the  characteristic  equation  is  now  a  cubic,  so 
that  there  are  three  eigenvalues,  although  the  general  principles,  with  regard 
to  stable  and  unstable  manifolds  according  to  the  sign  of  real  eigenvalues, 
still  apply. 

When  a,  (5  are  indeed  fixed  the  behaviour  of  the  system  (13.33)  in  the 
(x,  y,z)  phase  space  can  be  summarized  as  follows  (see  Problem  14): 

•  0  <  p  <  1:  The  origin  Pi  =  (0,0,0)  is  the  only  critical  point  of  the 
system,  and  it  is  a  point  attractor.  This  corresponds  physically  to 
the  case  of  no  motion  and  steady  heat  conduction  across  the  slab  (see 
Fig.  13.19(a)). 

•  1  <  p:  The  origin  Pi  is  now  unstable  and  two  new  critical  points 
appear, 


P2,P3  = 


(±y//3(p-l),±y/l3(p-l),  (p-1)) 


It  can  be  shown  that  when: 


*l<p<  pcrit 


a{a  +  /?  +  3) 
(a -P-l) 


in  the  practical  case  a  >  (3  +  1, 
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all  the  eigenvalues  in  the  analysis  of  Pi,Pz  are  negative  real  or 
have  negative  real  part,  so  that  P2,Pz  are  point  attractors.  This 
corresponds  physically  to  steady  convection  rolls  in  either  of  the 
two  senses  (see  Fig.  13.19(b)). 
*  Pcrit  <  P,  the  points  Pi,P2,Pz  are  all  unstable,  so  that  a  single 
point  attractor  does  not  exist.  In  fact  the  loss  of  asymptotic  stabil- 
ity of  Pi ,  Pz  can  be  shown  to  be  via  a  subcritical  Hopf  bifurcation. 


T„  '/////////////////////////////////////////; 

/\        /V  /\ 


Heat  flux 


r.  777777777777777777777777777777777777777777, 

(a)  No  motion 


Tc  V////////////////////////////////////////, 


V 


V 


v 


^  V7777777777777777777777777777777777777777/ 

(b)  Convective  rolls 


Fig.  13.19 

We  can  show  that  trajectories  are  attracted  towards  a  bounded  region 
containing  the  points  P\,Pi,P?,  and  there  should  be  an  attractor,  since 

everywhere,  which  means  that  the  phase-space  volume  elements  are  reduced 
progressively  in  volume  by  the  flow  x.  (See  §12.5  and  relation  (A. 24).) 

In  fact,  it  turns  out  that  there  is  a  value  p,  such  that  for  p  <  p  there 
is  a  highly  disordered  motion,  which  corresponds  physically  to  turbulent 
convection.  Trajectories  in  the  phase  space  tend  asymptotically  towards 
a  structure  which  is  multi- leaved  on  a  hierarchy  of  scales  (a  'fractal')  and 
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looped  around  the  unstable  critical  points  P2,P%-  This  is  called  a  chaotic 
or  strange  attractor,  on  account  of  its  peculiar  properties.  (See  for  example, 
Fig  13.20  in  which  a  =  10,  (3  =  §  and  p  =  28,  so  that  pcrit  =  24.74). 


Fig.  13.20  Lorenz  attractor  [Lanford,  (Lecture  Notes  in  Mathematics,  615,  114, 
Springer  1977)] 

For  the  particular  values  of  a,  f3  chosen  here  a  strange  attractor  of  this 
general  type  is  found  to  occur  when  p  >  p  =  24.06.  Evidently  in  a  (here) 
small  range  p  <  p  <  pcr;t  there  are  actually  three  attractors  —  two  points 
(steady  convection)  and  one  strange  (turbulent  convection).  In  practice 
each  of  these  will  have  its  own  basin  of  attraction  of  states  attracted  to 
it,  so  that  the  attractor  which  is  observed  to  occur  depends  on  the  initial 
state  of  the  system.  The  strange  attractor  has  zero  phase-space  volume  on 
account  of  (13.34)  above,  and  a  fractal  geometric  structure.  However,  states 
which  are  close  initially  are  stretched  out  along  the  attractor  exponentially 
with  time.  The  latter  property  is  characteristic  of  chaotic  systems  and 
there  is  more  discussion  of  this  property  in  the  following  section  (§13.7). 

The  Rikitake  two-disc  dynamo 

In  order  to  model  irregular  reversals  of  the  Earth's  magnetic  field  Rikitake 
put  forward  (in  Proceedings  of  the  Cambridge  Philosophical  Society,  54,  89- 
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105,  1958)  a  system  containing  two  identical  coupled  disc  dynamos  which 
does  mimic  this  type  of  behaviour.  (See  Fig.  13.21.) 


Identical  discs  driven  by  the 
same  constant  torque 


Fig.  13.21 

If  X\ ,  X2  are  scaled  currents  and  Y\ ,  Y2  are  scaled  rotation  rates  of  the 
discs,  which  are  driven  by  a  constant  external  torque,  the  current  in  each 
disc  feeds  the  coil  of  the  other.  Then  the  equations  may  be  written 

X1=YX2-nX1, 

X2  =  (Y-  A)Xi  -  fj,X2,  (13.35) 
Y  =  1  -  XXX2, 

where  Y  =  Y\  =  Y2  +  A  and  A,  ji  are  control  parameters,  with  \i  positive. 

For  this  system  there  are  two  critical  points  in  the  (X±,  X2,Y)  phase 
space,  at  (±fc,  ±l/fc,/xfc2),  where  A  =  /i(fc2  —  1/fc2)-  (See  Problem 
15.)  Although  the  three  eigenvalues  in  the  local  linear  approximation 
are  —  2/i,  ±i-y/fc2  +  1/fc2,  the  apparent  centre  behaviour  associated  with  the 
pure  imaginary  pair  of  eigenvalues  is  here  not  carried  through  for  the  full 
system.  The  critical  points  correspond  to  senses  of  North/South  magnetic 
polarity  and  the  irregular  reversals  which  are  associated  with  this  model 
(see  Fig.  13.22)  mimic  the  field  reversals  of  the  Earth's  magnetic  field  which 
are  known,  from  the  interpretation  of  geological  records,  to  have  occurred 
at  irregular  intervals  during  more  than  the  last  160  million  years. 

There  is,  of  course,  no  suggestion  that  the  two-disc  dynamo  provides 
other  than  an  interesting  analogy  for  features  of  the  magnetohydrodynamic 
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Fig.  13.22  Pro  jection  on  the  (X2,  Y)  plane  of  a  trajectory  in  phase  space  for  fi  =  1,  K  = 
2  [Cook  and  Roberts  (Proceedings  of  the  Cambridge  Philosophical  Society,  68,  547—569, 
1970)] 

motions  in  the  Earth's  core  —  it  is  a  'toy'  system. 
The  forced  damped  pendulum 

If  the  simple  pendulum  of  §2.1  and  §13.3  is  subjected  to  damping  and 
forcing  we  may  write  the  equation  of  motion  in  the  form 

mW  +  \6 +  mg smG  =  F cos  fit,  (13.36) 

where  m,  I,  A,  g,  F,  ft  are  control  parameters. 

This  equation  may  be  scaled  suitably,  so  that  we  study  the  equivalent 
third-order  autonomous  system 

e  =  u, 

ui  = —Au>  —  sinO  +  B  cos (f>,  (13.37) 
0  =  0, 

where  A,  B,  ft  are  control  parameters. 

The  nonlinearity  of  this  system,  through  the  sin  6,  cos  qb  terms,  makes 
for  a  very  rich  variety  of  behaviour  as  A,  B,  ft  are  changed. 
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The  variety  of  behaviours  instanced  in  this  whole  section  is  common 
for  third-  and  higher-order  continuous  systems,  but  systems  have  to  be 
'special'  in  order  to  be  effectively  reducible  to  lower  order  via,  for  example, 
constants  of  the  motion.  Systems  need  to  be  of  at  least  third  order  if  they 
are  to  exhibit  chaotic  behaviour. 


13.7    Sensitivity  to  Initial  Conditions  and  Predictability 

An  important  consequence  of  the  effective  order  of  a  dynamical  system  is 
in  the  way  that  the  resulting  dimension  of  the  phase  space  can  restrict  the 
evolution  along  trajectories  of  states  which  may  be  initially  close  together. 
In  the  line  and  plane,  typically,  states  which  are  initially  close,  whose  tra- 
jectories remain  in  a  bounded  region  and  do  not  intersect  one  another,  move 
apart  at  a  rate  which  is  at  most  essentially  uniform,  on  average. 

If  we  examine  a  dynamical  system  with  closed  trajectories  in  phase 
space,  we  can  examine  the  evolution  of  two  neighbouring  initial  states 

P=(x0,yo),       Q  =  (x0  +  Ax0,y0  +  Ay0). 

(See  Fig.  13.23.) 


Fig.  13.23 

Now,  if  the  system  is  the  simple  harmonic  oscillator  (see  §13.3)  then,  in 
this  special  case,  the  period  along  each  closed  orbit  is  the  same  and  it  can 
be  shown  that  the  ('Pythagorean')  distance  in  phase  space  between  P  and 
Q  remains  bounded  in  time.  This  behaviour  is  not  typical  and,  if  Fig.  13.23 
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is  taken  to  represent,  for  example,  libration  motions  of  a  simple  pendulum, 
it  should  be  evident  that  different  orbits  have  different  periods.  For  this, 
the  more  typical  situation,  the  distance  between  P  and  Q  increases  broadly 
linearly  with  time,  as  measured  along  the  orbit.  A  useful  analogy  is  to 
consider  runners  travelling  at  slightly  different  rates  in  neighbouring  lanes 
of  a  running  track  (see  Problems  f6,  f 7). 

In  three-  and  higher-dimensional  phase  spaces  the  neighbouring  trajec- 
tories can  diverge  exponentially  from  one  another  within  a  bounded  region, 
without  crossing  one  another.  They  can  do  this  by  wrapping  over  and  under 
one  another  in  a  complex  tangle,  which  may  still  have  a  layered  structure. 
(See  Fig.  13.20  and  Fig.  13.22.)  This  geometrical  behaviour  is  central  to 
the  'strangeness'  of  a  strange  attractor  to  which  we  referred  in  §13.6.  The 
states  which  are  initially  close  move  apart,  in  such  a  way  that  the  distance 
d  in  phase  space  between  them  increases  with  time,  on  average,  like  eA*, 
with  A  positive.  (See  Fig.  13.24.) 


The  parameter  A  in  the  exponent  is  called  a  Lyapunov  exponent.  When 
a  typical  small  volume  of  initial  states  in  phase  space  is  considered,  it  is 
evident  that  dynamical  evolution  will  result,  in  general,  in  stretching  and 
squeezing  of  this  volume.  There  will  be  a  number  of  principal  deforma- 
tions, which  is  equal  to  the  dimension  of  the  phase  space,  leading  to  a  set 
of  Lyapunov  exponents.  Each  of  the  positive  exponents  leads  to  a  corre- 
sponding strong  divergence  of  initial  states.  We  can  evidently  test  flows  in 
phase  space  for  this  divergence  property.  The  strong  divergence  is  called 
sensitivity  to  initial  conditions  and  it  is  the  prime  property  which  identifies 
the  irregular  behaviour  now  called  chaos. 


d—dtleM  on  average 


Fig.  13.24 
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The  effect  on  the  global  behaviour  of  dynamical  systems  produced  by 
sensitivity  to  initial  conditions  is  often  called  the  butterfly  effect,  which  was 
first  put  forward  by  Lorenz  in  an  address  to  the  American  Association  for 
the  Advancement  of  Science  in  December  1972  —  'Predictability:  Does  the 
flap  of  a  butterfly's  wings  in  Brazil  set  off  a  tornado  in  Texas?'  —  printed 
in  his  book  The  Essence  of  Chaos  referred  to  above  in  the  description  of  the 
Lorenz  system  (13.33).  The  point  he  was  making  is  that  chaotic  regimes  in 
weather  systems  can  render  predictions  of  them  utterly  useless,  except  on  a 
very  short  time  scale.  The  term  'butterfly  effect'  has  become  a  much  used 
metaphor  for  describing  chaos,  which  has  experienced  an  explosive  growth 
in  application. 

The  consequences  for  all  dynamical  systems  and  for  the  real  situations 
they  model  are  immense.  A  direct  attack  is  made  on  the  long-standing 
philosophical  idea  of  determinism  and  on  the  concept  of  a  'clockwork  uni- 
verse'. The  Laplacian  point  of  view  identifies  the  initial  state  of  a  system 
as  the  cause,  from  which  (if  it  is  known  precisely)  the  effects  may  be  cal- 
culated with  infinite  precision  indefinitely  far  into  the  future,  or  into  the 
past.  There  is,  in  this  world  view,  no  room  for  chance  or  free  will. 

The  sensitivity  to  initial  conditions,  which  is  typical  of  many,  if  not  most, 
systems  and  is  made  very  apparent  by  the  geometrical  approach  pioneered 
by  Poincare,  leads  to  a  loss  of  predictability .  This  comes  about  because  the 
inevitable  imprecision  in  the  initial  description  of  even  a  classical  system, 
however  small  this  may  be,  leads  later  to  a  vastly  greater  imprecision  in 
its  predicted  fate.  It  is  in  this  way  that  deterministic  systems,  where  the 
mechanisms  may  be  known  completely  and  precisely,  can  have  an  essentially 
stochastic  ('random')  output,  so  that  outcomes  can  be  predicted  only  as  a 
matter  of  probability. 

A  good  illustration  of  this  apparent  paradox  is  provided  by  the  Galton 
board  (see  Fig.  13.25)  where  small  spheres  of  lead  shot  fall  through  a  regular- 
array  of  scattering  pins. 

The  path  of  an  individual  piece  of  lead  shot,  which  falls  through  such 
an  array  of  pins,  is  impossible  to  predict,  because  of  the  sensitivity  of  the 
collisions  at  individual  pins.  However  the  typical  outcome  for  the  resulting 
distribution  of  shot  along  the  channels  at  the  base  is,  in  this  case,  very  like 
the  normal  probability  distribution.  In  this  way  predictability  of  the  fate 
of  an  individual  piece  of  lead  shot  gives  way  to  a  collective  predictability 
for  many  pieces  in  the  form  of  chance.  (See  Problem  18.) 
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Fig.  13.25 

13.8  Summary 

The  analysis  of  the  geometry  of  dynamical  systems  in  phase  space  is  a 
very  powerful  way  of  investigating  their  behaviour  for  all  possible  initial 
conditions.  The  dynamics  is  dependent  crucially  on  attractors  and  on  their 
stability. 

For  systems  which  are  dynamically  one-dimensional  or  two-dimensional 
the  only  attractors  are  critical  points  (equilibria)  in  the  phase  line  and  the 
phase  plane  and,  for  the  latter,  limit  cycles.  As  control  parameters  are 
varied,  the  character  of  critical  points  and  limit  cycles  may  change,  so  that 
there  is  a  corresponding  change  in  the  phase  portraits  and  in  the  behaviour 
of  the  system. 

In  systems  of  third  (or  higher)  order  additional  attractors  may  arise 
which  can  have  a  fractal  and  sensitive  structure  —  strange  (chaotic)  at- 
tractors. 

In  the  phase  plane  the  system  trajectories  are  constrained  such  that 
the  divergence  of  initial  states  takes  place  essentially  at  a  uniform  rate, 
at  most.  For  a  strange  attractor,  in  systems  of  third  or  higher  order,  the 
divergence  will  be  essentially  exponential  and  this  leads  to  a  breakdown  of 
predictability.  The  consequence  is  that  deterministic  systems  can  produce 
effectively  stochastic  output.  This  has  profound  implications  for  our  view 
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of  physical  and  other  dynamical  systems;  it  can  also  be  expected  to  affect 
the  relationship  between  classical  physics  and  its  quantum  counterpart. 


Problems 

1.  The  population  x(t)  of  fish  in  a  pond  is  a  function  of  time  t  and  when  the 
fish  are  removed  at  a  constant  rate  /  the  function  x(t)  satisfies  equation 
(13.6). 

(a)  Show  that  there  is  a  critical  fishing  rate  /  such  that  for  /  <  /  there  are 
two  equilibrium  fish  populations,  of  which  only  one  is  asymptotically 
stable. 

(b)  If  the  fish  population  is  initially  in  stable  equilibrium  (with  /  =  0 
and  fishing  commences  at  a  rate  /  >  /,  show  that  x  — >  0  and  in  a 
finite  time  to  (which  need  not  be  found  explicitly!). 

Why  would  it  not  be  prudent  to  fish  at  a  rate  /  which  is  only  just  less 
than  /? 

2.  For  the  pendulum  equation  0  +  {g/l)sai6  =  0  find  the  equations  of 
the  trajectories  in  the  phase  plane  and  sketch  the  phase  portrait  of  the 
system.  Show  that  on  the  separatrices 

9(t)  =  (2n  +  l)7r±4axctan[exp(a;t  +  a)], 

where  n  is  an  integer,  a  =  constant  and  u>  =  y/g/l.  [Hint:  Use  the 
substitution  u  =  tan  \  {0  —  (2n  +  1)tt}.] 

3.  The  relativistic  equivalent  of  the  simple  harmonic  oscillator  equation  for 
a  spring  with  constant  k  and  a  rest  mass  mo  attached  is 

—  (  —      °^    -  )  +  kx  =  0       with       x  =  y, 

where  c  is  the  speed  of  light.  Show  that  the  phase  trajectories  are  given 
by 

rnoc2 /\Jl  —  y2/c2  +  7}kx2  =  constant 

and  sketch  the  phase  portrait  for  this  system. 

4.  Draw  the  phase  portrait  of  the  damped  linear  oscillator,  whose  displace- 
ment x(t)  satisfies  x  +  fix  +  uj^x  =  0,  in  the  phase  plane  (x,y),  where 
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y  =  x.  Distinguish  the  cases 


(a)  under-  (or  light)  damping 

(b)  over-damping 

(c)  critical  damping 


0  <  /i  <  2u>o, 
/i  >  2uj0, 


5.  Consider  the  gradient  system  (13.17)  in  the  case  U(x,  y)  =  x2  (x—1)2 +y2 . 
Find  the  critical  points  and  their  character.  Sketch  the  phase  portrait 
for  the  system. 

6.  For  the  Lotka-Volterra  system  (13.18)  show  that  the  trajectories  in  the 
phase  plane  are  given  by  f(x,y)  =  constant  as  in  (13.20).  In  the  first 
quadrant  x  >  0,  y  >  0,  the  intersections  of  a  line  y  =  constant  with  a 
trajectory  are  given  by  —clnx  +  dx  =  constant.  Hence  show  that  there 
are  0,  1  or  2  such  intersections,  so  that  the  equilibrium  point  (c/d,a/b) 
fore  this  system  is  a  true  centre  (i.e.  it  cannot  be  a  spiral  point).  Using 
the  substitution  x  =  ep,y  =  e9,  show  that  the  system  takes  on  the 
Hamiltonian  canonical  form  (13.22). 

7.  Consider  the  'competing  species'  system  (13.23).  For  the  case  k\  = 
cj\  =  a.\  =  l,fc2  =  |,o-2  =  i,a2  =  |  find  the  critical  points  and  their 
character  in  order  to  confirm  all  the  features  of  Fig.  13.9.  What  happens 
in  the  case  k\  =  1,  G\  =  2,  cti  =  |,  hi  =  3,  <t2  =  2,  a2  =4? 

8.  For  the  Arms-Race  model  system  (13.26)  with  all  parameters  positive 
show  that  there  is  an  asymptotically  stable  coexistence  or  a  runaway 
escalation  according  as  CiC2  >  aia2  or  Cic2  <  dia2. 

9.  *A  simple  model  for  the  dynamics  of  malaria  due  to  Ross  (1911)  and 
Macdonald  (1952)  is 


where: 

x,y  are  the  infected  proportions  of  the  human  host,  female  mosquito 
populations, 

N,  M  are  the  numerical  sizes  of  the  human,  female  mosquito  populations, 

a  is  the  biting  rate  by  a  single  mosquito, 

b  is  the  proportion  of  infected  bites  that  result  in  infection, 

r,  ji  are  per  capita  rates  of  recovery,  mortality  for  humans,  mosquitoes, 

respectively. 

Show  that  the  disease  can  maintain  itself  within  these  populations  or 


i  =  2/(1  "  x)  ~  rx, 

y  =  ax(l  -y)-  jiy, 
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must  die  out  according  as 

_     Ma2b  i 

R  =  —        >  1  or  <  1. 

N  fir 

10.  Show  that  Rayleigh's  equation  in  the  form  (13.28)  has  a  single  critical 
point  at  (0, 0)  and  that  this  is  always  unstable.  Making  use  of  substitu- 
tion x  =  v/y/3  show  that  v  satisfies  the  Van  der  Pol  equation  (13.29). 

11.  Show  that  the  origin  is  the  only  critical  point  for  the  system 

x  =  —y  +  ax(P  —  x2  —  y2), 
y  =  x  +  ay((3  -  x2  -  y2), 

where  a,  j3  are  real  parameters,  with  a  fixed  and  positive  and  (3  allowed 
to  take  different  values.  Show  that  the  character  of  the  critical  point 
and  the  existence  of  a  limit  cycle  depend  on  the  parameter  [3,  so  that 
the  system  undergoes  a  supercritical  Hopf  bifurcation  at  (3  =  0.  (Hint: 
Make  the  change  from  Cartesian  co-ordinates  to  plane  polars.) 

12.  *For  the  rotation  of  a  rigid  body  about  its  centre  of  mass  with  zero 
torque  the  equations  for  the  angular  momentum  components  Ji,  J2,  J3 
are  given  by  (13.31). 

(a)  Show  that  J\  +  J|  +  J|  =  J2  (constant),  so  that  the  angular  mo- 
mentum J  must  lie  on  a  sphere  in  ( J1;  3 2,  J3)  phase  space. 

(b)  When  I\  <  I2  <  h  show  that  there  are  six  critical  points  on  this 
phase  sphere  and  show  that,  in  local  expansion,  four  of  these  are 
centres  and  two  are  saddles.  (Hence  the  tennis  racquet  theorem  of 
§13.6.) 

(c)  Show  that  when  h  =  I2  ^  I3  then  J3  is  constant  (=  I3Q)  and  that 
Ji,  J2  are  simple  harmonic  (with  frequency  [I/3  —  ii|//i]fi). 

(d)  A  space  station  with  ii  <  I2  <  I3  is  executing  a  tumbling  motion 
with  oji,u>2,uJ3  nonzero.  It  is  to  be  stabilized  by  reducing  w  to  0 
with  an  applied  torque  —  \/j,\u>,  with  [i  constant,  so  that  the  right- 
hand  side  of  (13.31)  becomes  —  |/i|o;i,  etc.  About  which  of  its  axes 
does  the  space  station  tend  to  be  spinning  as  w  — >  0? 

13.  *The  simple  SIR  model  equations  for  the  transmission  of  a  disease  are 

S  =  -aSI, 
I  =  aSI-  bl, 
R  =  bL 
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where  S(t),  I(t),  R(t)  are  respectively  susceptibles,  infectives,  re- 
moved/recovered and  a,b  are  positive  constants. 

(a)  Show  that  the  overall  population  TV  =  S+I+R  remains  constant,  so 
that  we  may  consider  (S,  I)  in  a  projected  phase  plane.  Hence  show 
that  a  trajectory  with  initial  values  (So,  Jo)  has  equation  I(S)  = 
I0  +  S()-S+(b/a)\n(S/S0). 

(b)  Using  the  function  I(S)  show  that  an  epidemic  can  occur  only  if 
the  number  of  susceptibles  So  in  the  population  exceeds  the  thresh- 
old level  b/a  and  that  the  disease  stops  spreading  through  lack  of 
infectives  rather  than  through  lack  of  susceptibles. 

(c)  For  the  trajectory  which  corresponds  to  Sq  =  (b/a)  +  5,  Iq  =  e 
with  S,  e  small  and  positive,  show  that,  to  a  good  approximation, 
there  are  (b/a)  —  S  susceptibles  who  escape  infection  [the  Kermack- 
McKendrick  theorem  of  epidemiology  (1926/27)]. 

14.  *Consider  the  Lorenz  system  (13.33). 

(a)  Show  that  the  origin  P i  (0,  0,  0)  is  a  critical  point  and  that  its  sta- 
bility depends  on  eigenvalues  A  satisfying  the  cubic 

(A  +  f3){\2  +  (a  +  1)A  +  a(l  -  p)]  =  0. 

Hence  show  that  Pi  is  asymptotically  stable  only  when  0  <  p  <  1. 

(b)  Show  that  there  are  two  further  critical  points 

P2,P3  =  [±^f3(p-l),±^f3(p-l),  (p-l)], 

when  p  >  1,  and  that  their  stability  depends  on  eigenvalues  A  satis- 
fying the  cubic 

A3  +  X2(a  +  P  +  1)  +  \f3(a  +  p)  +  2<jf3(p  -  1)  =  0. 

(c)  Show  that  when  p  =  1  the  roots  of  the  cubic  in  (b)  are  0,  —  /?,  — (1+cr) 
and  that  in  order  for  the  roots  to  have  the  form  —  p,  ±iv  (with  p,  v 
real)  we  must  have 

a(a  +  f3  +  3)  n 
P  =  P-*=   {a-P-l)  >0- 

(d)  By  considering  how  the  roots  of  the  cubic  in  (b)  change  continuously 
with  p  (with  (a,  (3  kept  constant),  show  that  P2,Prs  are  asymptoti- 
cally stable  for  1  <  p  <  pcr-lt  and  unstable  for  p  >  pcrit. 
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(e)  Show  that  if  z  =  z  —  p  —  a,  then 


|d(a;2+2y2  +  2-2) 


ax2-y2-P[z+^p  +  a)]2  +  \P(p  +  a)2, 


so  that  (x2  +  y2  +  z2)1/2  decreases  for  all  states  outside  any  sphere 
which  contains  a  particular  ellipsoid  (implying  the  existence  of  an 
attractor). 

15.  *For  the  Rikitake  dynamo  system  (13.35): 

(a)  Show  that  there  are  two  real  critical  points  at  (±fc,  ±l/fc,  pk2)  in  the 
(X±,  X2,  Y)  phase  space,  where  k  is  given  by  A  =  p(k2  —  l/k2). 

(b)  Show  that  the  stability  of  these  critical  points  is  determined  by  eigen- 
values A  satisfying  the  cubic 


so  that  the  points  are  not  asymptotically  stable  in  this  approxima- 
tion. (For  the  full  system  they  are  actually  unstable.) 
(c)  Show  that  the  divergence  of  the  phase-space  flow  velocity  is  negative, 
so  that  the  flow  causes  volume  to  contract. 


and  use  this  result  to  determine  in  which  region  of  the  space  the 
trajectories  all  have  a  positive  inward  component  towards  X±  = 
X2  =  Y  =  0  on  surfaces  X\  +  X$  +  Y2  =  constant. 

16.  For  the  simple  harmonic  oscillator  system  (13.8)  we  may  solve  for  the 
equation  of  the  trajectory  which  passes  through  x(xo,  yo,  t),  y(xo,  yo,  t). 
If  we  now  consider  perturbations  Axo,Ayo  in  the  initial  data  (xo,yo) 
at  t  =  0,  find  the  resulting  changes  Ax,  Ay  in  x,  y  and  show  that 
they  remain  bounded  in  time,  so  that  the  Pythagorean  distance  d  = 
\J (Aa;)2  +  {Ay)2  then  remains  bounded. 

17.  *In  contrast  to  Problem  16,  consider  the  perfectly  elastic  bouncing  of  a 
ball  vertically  under  gravity  above  the  plane  x  =  0.  We  have  x  =  y,y  = 
—g  and  we  can  solve  for  x{xq,  yo,  t),  y(xo,  yo,  i)  in  terms  of  the  initial 
data  {xo,  yo)  at  t  =  0.  Show  that  in  this  case  the  resulting  perturbations 
Aa;,  Ay  essentially  grow  linearly  with  time  t  along  the  trajectory  when 
we  make  perturbations  Aaro,  Aj/q  in  the  initial  data.  That  is  to  say  the 


(d) 


Given  that  Y  =  y/2(Y  -  A/2),  show  that 


\±-{X2  +  X2  +  Y2)  =  -p(X2  +  X2)  + 
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distance  along  the  trajectory  d  =  yj (A:r)2  +  (At/)2  ~  nt  when  t  is  large 
and  k  is  a  suitable  constant. 
18.  For  the  Galton  board  of  Fig.  13.25  we  may  arrange  things  so  that  each 
piece  of  lead  shot  has  an  equal  chance  of  rebounding  just  to  the  left  or 
to  the  right  at  each  direct  encounter  with  a  scattering  pin  at  each  level. 
Show  that  the  probabilities  of  each  piece  of  shot  passing  between  the  pins 
along  a  particular  row  n  are  then  given  by  (")(|)n  where  the  binomial 
coefficient  (™)  =  n\/[{n  —  r)!r!]  and  r  =  0,1,..., n.  Use  the  result 
(™+i)  =  (")  +  (r+i)  to  generate  the  probability  distribution  for  row  n  = 
16.  (For  large  numbers  of  pieces  of  shot  and  large  n  the  distribution  of 
shot  in  the  collection  compartments  approximates  the  standard  normal 
error  curve  y  =  fcexp(— x2 /2s2)  where  k,  s  are  constants.) 


Chapter  14 

Order  and  Chaos  in 
Hamiltonian  Systems 

In  this  chapter  we  consider  the  geometrical  aspects  of  the  particular  class 
of  dynamical  systems  which  have  the  Hamiltonian  structure  described  in 
Chapter  12.  The  dynamics  of  these  systems  depends  crucially  on  the  num- 
ber of  symmetries  which  they  possess,  leading  to  conserved  quantities.  It 
will  become  apparent  that  the  larger  the  number  of  symmetries  the  greater 
is  the  restriction  in  the  freedom  of  a  trajectory  in  the  corresponding  phase 
space  and  the  more  ordered  is  the  motion.  In  contrast  a  deficit  in  the  num- 
ber of  such  symmetries  allows  the  development  of  irregularity,  or  chaos,  in 
the  evolution  of  states  along  the  system  trajectories. 

14.1  Integrability 

In  §12.6  we  discussed  the  direct  relation  between  symmetries  and  conserva- 
tion laws,  indicating  that  conserved  quantities  (constants  of  motion)  may 
be  used  to  reduce  progressively  the  effective  order  of  a  system.  If  there 
are  enough  symmetries  and  independent  conserved  quantities,  then  the  re- 
duction may  be  complete,  in  that  it  leads  to  the  full  solution  of  the  sys- 
tem of  equations  ('integrability').  For  a  system  with  n  degrees  of  freedom 
the  Hamiltonian  H  may  depend  on  the  co-ordinates  qi, . . . ,  qn,  momenta 
pi, . . .  ,pn  and  on  time  t.  While  there  is  no  unique  definition  of  integra- 
bility, it  turns  out  that,  for  the  complete  solution  to  be  obtainable  via  a 
sequence  of  integrations,  we  need  n  independent  functions  Fi  (i  =  1, . . . ,  n) 
of  the  same  variables,  which  are  themselves  constants  of  the  motion.  That 
is  to  say,  we  require  the  same  number  of  symmetries  as  there  are  degrees  of 
freedom  in  the  system  and  the  system  is  then  said  to  be  'integrable  (in  the 
sense  of  Liouville)'.  If  there  is  not  such  a  full  set  of  n  conserved  quantities 
then  the  system  is  said  to  be  non-integrable. 
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Recalling  (12.33), 

+  (14.1) 

We  will  confine  our  discussion  here  to  autonomous  systems,  for  which 
H  and  other  Fi  have  no  explicit  dependence  on  t,  and  we  then  have  n 
Poisson-bracket  relations  [Fi,H\  =  0  from  (14.1).  Of  course  H  itself  can 
be  taken  as  one  of  the  functions  Fi  and  the  invariance  of  H  corresponds  to 
symmetry  with  respect  to  time  translation. 

An  additional  technical  condition  required  of  the  Fi  is  that  for  each  pair 
of  them  we  require 

[Fi,Fj]=0       for       i,j  =  l,...n.  (14.2) 

The  functions  Fi  are  then  said  to  be  in  involution. 

Each  of  the  n  functions  Fi  =  constant  confines  a  particular  trajectory 
to  a  (2n  —  l)-dimensional  subspace  of  the  full  2n-dimensional  phase  space  of 
the  system.  So  the  complete  set  of  these  restrictions  confines  a  trajectory  to 
the  n-dimensional  intersection  M  of  these  subspaces.  That  this  is  a  strong 
dimensional  restriction  on  the  way  that  a  trajectory  can  move  in  the  full 
phase  space  is  indicated  by  Table  14.1. 

Table  14.1 

Number  of  degrees  of  freedom  1      2      3  n 

phase  space  2      4      6  2n 

Dimension  of:      surface  Fi  =  constant      1      3      5      2n  —  1 
M  1      2      3  n 


Naturally  the  values  of  co-ordinates  and  momenta  at  a  given  initial  time 
determine  the  n  values  of  the  Fi  constants  and  hence  pick  out  the  particular 
M  to  which  the  trajectory  having  these  initial  values  is  then  confined  for 
all  time. 

Now  at  each  point  on  a  particular  (2n  —  l)-dimensional  surface  Fi  = 
constant,  the  normal  to  the  surface  is  in  the  direction  given  by  the  2n- 
component  gradient  vector  (VgFj,  VpFj).  (See  §A.5.) 

The  Poisson-bracket  relation  (14.2)  is  a  scalar-product  relation  and  it 
expresses  the  orthogonality  conditions  between  members  of  a  set  of  n  vector 
fields  Vj  =  (VpFj,  —~VqFj),  j  =  1, . . .  ,n  and  the  above  gradient  vector.  It 
is  apparent  therefore  that  the  vector  fields  vj  at  a  particular  point  in  the 
phase  space  are  all  parallel  to  the  surface  M  which  contains  that  point.  The 
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fields  Vj  allow  us  to  define  a  co-ordinate  grid  continuously  over  the  surface 
M  and  without  any  singularity.  The  topology  of  M  is  then  necessarily 
that  of  a  torus  —  an  n-dimensional  'ring  doughnut'  embedded  within  the 
full  2n-dimensional  phase  space. 

For  a  system  of  one  degree  of  freedom  each  M  is  a  1-torus  F\  =  H  = 
constant,  i.e.  a  one  dimensional  loop  in  the  two-dimensional  phase  space. 
(See  Fig.  14.1(a).)  Evidently  v\  here  is  tangent  to  the  curve  at  each  point 
of  M.  For  different  values  of  F\  we  obtain  different  loops  M,  members  of 
a  set  of  'nested'  tori,  as  for  example  in  Fig.  13.4. 

For  a  system  of  two  degrees  of  freedom  each  M  is  a  2-torus,  i.e.  a 
two-dimensional  'ring  doughnut'  surface  embedded  in  the  four-dimensional 
phase  space.  (See  Fig.  14.1(b).)  In  this  case  we  can  take  wi,  i>2  everywhere 
parallel  to  distinct  loops  which  can  parametrize  the  surface.  It  is  the  case 
that  M  cannot  have,  for  example,  the  topology  of  a  sphere,  because  all 
Vi,V2  parametrizations  of  such  a  surface  are  necessarily  singular,  at  least 
somewhere.  In  Fig.  14.1(c)  the  attempt  to  use  a  grid  of  lines  of  latitude 
and  longitude  is  singular  at  the  poles.  That  all  such  attempts  for  a  sphere 
are  bound  to  be  singular  is  a  consequence  of  the  'hairy  ball  theorem'  of 
topology;  this  theorem  also  implies,  among  other  things,  the  existence  of 
at  least  one  crown  parting  in  the  combing  of  a  head  of  hair,  and  that  a 
spherical  magnetic  bottle  must  leak! 


(a)  (b)  (c) 


Fig.  14.1 

The  n-tori  for  an  integrable  system  of  n  degrees  of  freedom  in  its  2n- 
dimensional  phase  space  are  often  called  invariant  tori  because  a  trajectory 
(orbit)  which  is  initially  on  one  of  them  (M)  remains  on  it  for  ever.  Dif- 
ferent initial  conditions  then  lead  to  tori  which  are  nested  within  the  phase 
space. 
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On  a  particular  torus  M  the  separate  vector  fields  correspond  to  n  in- 
dependent types  of  linking  circuit  (see  Fig.  14. 1(a), (b)),  and  the  integrable 
motion  is  then  exactly  equivalent  to  the  combination  of  n  separate  corre- 
sponding periodic  motions,  one  for  each  type  of  linking  circuit.  Each  of 
these  periodic  motions  of  an  integrable  system  is  a  nonlinear  normal  mode, 
similar  to  those  detailed  in  §11.3,  but  now  with  an  analysis  which  is  ex- 
act —  no  approximation  has  been  made  for  small  oscillations.  This  makes 
integrability  a  very  special  property. 

The  separate  normal  modes  have  associated  natural  frequencies  a;,,  i  = 
1, . . . ,  n  on  a  particular  torus  M  and  any  trajectory  on  M  has  n  associated 
natural  periods  2n/u)i.  The  trajectory  is  then  said  to  be  multiply  periodic. 

If  the  trajectory  on  M  is  closed  (see  Fig.  14.2(a))  then  it  does  not  fill 
M  and  is  exactly  periodic.  For  closure  it  is  required  that  the  frequencies 
of  the  the  modes  are  rationally  related,  i.e.  uJi/ujj  is  rational  for  all 
This  degeneracy  is  the  exception  rather  than  the  usual  case,  so  that  closure 
is  a  special  property.  If  the  trajectory  on  M  is  not  closed,  because  of 
irrational  C0i/c0j,  then  typically  the  trajectory  covers  M  densely  eventually 
(see  Fig.  14.2(b))  —  we  have  quasiperiodicity  and  the  trajectory  is  said  to 
be  ergodic  on  M. 


(a)  Trajectory  closed  (b)  Trajectory  eventually  covers  the  surface 

densely 


Fig.  14.2 

Two  important  examples  are  given  by: 

Example:  Double  harmonic  oscillator  and  central  force  motion 

Find  suitable  independent  constants  F\ ,  F2  for 

(a)  the  double  harmonic  oscillator, 

(b)  central  force  motion. 
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Are  trajectories  closed? 

(a)  For  the  double  harmonic  oscillator  we  have 

H  =  ^(Pi+Pl)  +  hm(^i1i  +  ^l<&)  =  Hx  +  H2  (14.3) 

with  H\,H2  the  Hamiltonians  for  the  two  separate  oscillators.  Here  we 
may  take  F\,  F2  to  be  any  pair  of  H,  Hi,  H2,  since  then  [Ft,  H]  =  0.  The 
frequencies  are  Wi,u}2  and  this  evidently  leads  to  closed  trajectories  (pe- 
riodicity) only  when  OJ1/UJ2  is  rational,  leading  to  the  familiar  Lissajous 
figures. 

(b)  Central-force  motion  was  discussed  in  Chapter  4.  The  Hamilto- 
nian takes  the  form  (12.8)  and  we  can  take  Fi  =  H ,  F2  =  pe,  so  that 
[Fi,H]  =  0.  For  the  isotropic  harmonic  oscillator  of  §4.1  we  can  take 
uji  =  u>2  in  (14.3)  above,  so  that  all  trajectories  are  closed.  The  oscillator 
describes  an  ellipse  with  centre  at  the  origin  and  the  frequencies  of  radial 
and  transverse  motions  are  related  by  uir  =  2oj$.  For  the  inverse  square 
law  attractive  force  of  §4.3  all  bounded  trajectories  are  closed.  They  are 
ellipses  in  general,  with  the  centre  of  force  at  a  focus  and  uir  =  cog .  That 
the  isotropic  harmonic  oscillator  and  the  inverse  square  law  of  attraction 
are  the  only  such  force  laws  for  which  all  bounded  orbits  are  closed  is 
known  as  Bertrand's  theorem. 


14.2    Surfaces  of  Section 

If  we  consider  two  dynamical  states  which  are  initially  close  in  the  phase 
space  of  an  integrable  system,  they  do,  in  general,  have  different  values  of 
iq,i<2  and  so  they  reside  on  different  tori.  The  evolution  of  these  states 
along  their  respective  trajectories  takes  place  with  slightly  different  frequen- 
cies, so  that  the  distance  between  the  states  increases  essentially  linearly 
with  time  t.  So,  just  as  for  the  discussion  of  the  libration  motions  of  a 
simple  pendulum  in  §13.7,  there  is  for  an  integrable  system  no  exponential 
divergence  of  states  with  time  and  no  sensitivity  to  initial  conditions  —  the 
motion  is  ordered. 

When  a  full  complement  of  constants  of  the  motion  does  not  exist, 
so  that  we  have  a  system  which  is  not  integrable,  then  trajectories  have 
much  less  restriction  on  their  room  to  manoeuvre  and  sensitivity  to  initial 
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conditions  is  certain  in  at  least  some  regions  of  the  phase  space  —  there  is 
then  inevitably  chaotic  behaviour,  as  indicated  in  §13.7. 

A  method  of  detecting  these  ordered  and  chaotic  behaviours  was  put 
forward  by  Poincare  and  involves  the  concept  of  a  surface  of  section.  If 
we  consider  an  autonomous  system  with  two  degrees  of  freedom,  so  that 
we  have  a  four-dimensional  phase  space,  with  three-dimensional  surfaces 
of  constant  'energy'  (H  =  constant),  then  we  can  consider  taking  a  'slice' 
through  the  space,  and  this  may  or  may  not  be  planar  according  to  conve- 
nience. 

If  we  look  at  a  particular  trajectory  of  the  system  and  identify  the  points 
at  which  this  intersects  the  section  (in  the  same  sense)  then  the  dynamics 
induces  a  map  of  the  section  to  itself.  In  Fig.  14.3  we  have  H(qi,  q2,Pi,Pi)  = 
constant  and  we  have  chosen  a  section  qi  =  0  (with  p2  >  0)  to  demonstrate 
how  a  trajectory  maps  points  in  the  (qi,pi)  plane  successively. 

Pi 


Fig.  14.3 

When  we  map  different  {q\,p\)  points  using  different  trajectories  we 
develop  a  Poincare  return  map  of  the  section  to  itself. 

For  integrable  systems  we  may  take  a  section  through  the  nested  tori 
of  §14.1.  For  two  degrees  of  freedom  the  section  through  a  2-torus  (ring 
doughnut)  is  a  closed  curve  C  (see  Fig.  14.4)  and  a  particular  point  Pq  on 
this  curve  is  mapped  successively  by  the  trajectory  through  it  to  Pi,  then 
P2,  and  so  on.  If  the  trajectory  is  closed  (see  §14.1)  then  we  have  a  periodic 
motion  and  the  map  has  the  property  that  after  N  steps,  for  some  integer 
N,  it  carries  Pq  back  to  Po.  The  successive  points  P  are  twisted  around 
the  curve  C.  If  the  motion  is  only  quasiperiodic,  however,  there  is  no  such 
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(a)  Section  of  torus  (b)  Effective  twist  around  curve 

C  with  each  cycle 

Fig.  14.4 

exact  repetition  and  the  successive  points  P  eventually  fill  the  whole  curve 
C.  Integrable  systems  are  characterized  by  a  finite  set  of  discrete  points  or 
by  curve-filling  in  the  surface  of  section. 

For  systems  which  are  not  integrable  the  full  torus  structure  does  not 
exist.  For  two  degrees  of  freedom  we  expect  the  system  to  explore  the 
full  three-dimensional  'energy'  surface.  The  section  return  map  is  then 
generally  two-dimensional  and  is  not  confined  to  points  and  lines.  Systems 
which  are  non-integrable  are  characterized  by  area-filling  in  the  surface  of 
section. 

It  should  be  noted  that  the  Poincare  return  map  for  a  Hamiltonian 
system  can  be  shown  to  be  area-preserving.  This  property  is  related  to 
Liouville's  theorem,  treated  in  §12.5,  although  it  is  not  derivable  from  it. 
Despite  the  equal-area  property,  the  map  induced  by  a  particular  dynamical 
system  typically  involves  stretching  and  squeezing  of  finite  and  infinitesi- 
mal area  elements,  associated  with,  respectively,  positive  and  negative  Lya- 
punov  exponents  (see  §13.7)  whose  effects  balance.  When  these  effects  are 
combined  with  a  folding  of  trajectories,  necessitated  by  confinement  to  a 
bounded  region  of  phase  space,  it  is  clear  that  the  divergence  of  neighbour- 
ing trajectories  leading  to  the  sensitivity  to  initial  conditions  of  chaos  can 
then  typically  occur. 

For  systems  which  are  not  Hamiltonian  the  concept  of  a  Poincare  return 
map  for  a  surface  of  section  is  still  useful.  When  the  dynamics  involves 
dissipation  the  equal-area  property  is  replaced  by  area  reduction  at  each 
iteration  of  the  map  —  this  would  be  the  case  e.g.  for  the  Lorenz  system  of 
§13.6  —  and  here  the  effect  of  a  negative  Lyapunov  exponent  overcomes  the 
effect  of  one  which  is  positive.  (There  is  some  reference  to  the  properties 
of  discrete  maps  in  Appendix  D.) 
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14.3    Action/Angle  Variables 


Quite  often  the  initial  choice  of  variables  qi ,  pi  (i  =  1 , . . . ,  n)  used  to  describe 
a  system  of  n  degrees  of  freedom  may  not  be  the  best. 

For  integrable  systems  there  is  a  natural  set  of  co-ordinates  and  mo- 
menta which  is  particularly  convenient  and  useful.  We  recall  that  these 
systems  have  n  distinct  constants  of  the  motion  (Fj  =  constant)  and  we 
can  transform  to  a  new  set  of  co-ordinates  <pi  and  momenta  U,  in  such  a 
way  that 


•  the  Hamiltonian  form  of  the  equations  of  motion  is  preserved, 

•  the  new  momenta  It  are  functions  of  the  Fi, 

•  the  new  co-ordinates  4>i  are  all  ignorable  (as  in  §12.3). 


For  a  suitable  choice  the  are  angle  variables  and  the  ij  are  the  corre- 
sponding canonically  conjugate  action  variables  (see  Table  14.2). 


Table  14.2 


Old  description  New  description 

(?i.Pi)  (i  =  l,--.,n)  {<j>j,Ij)  (j  =  1, . . .  ,n) 

H(qi,Pi)  K(Ij)  EE  H 

dH  ■  dK 

Pl  =  ~7T~  ij  =  ~  7TT  =  0 

oqi  Otpj 


It  is,  of  course,  apparent  that  the  new  momenta  Ij  are  all  constants  of 
the  motion  and  that  the  evolution  of  each  <pj  is  given  by  <j>j  =  u)jt+f3j,  where 
u>j ,  j3 j  are  constants.  Evidently  the  angle  variables  evolve  at  a  uniform  rate. 

Although  these  are  general  results,  let  us  consider  how  the  change  of 
variables  may  be  effected  for  a  system  of  one  degree  of  freedom  (n  =  1).  For 
such  an  autonomous  system  we  have  H  =  constant  ('energy')  closed  trajec- 
tories in  the  phase  plane,  representing  libration  motions.  (See  Fig.  14.5(a).) 
We  need  to  choose  the  action  /  to  be  a  suitable  function  of  H,  such  that 
the  phase  plane  in  the  new  description  is  as  given  in  Fig.  14.5(b). 

For  Hamilton's  equations  to  be  preserved  in  form  it  is  necessary  that 
the  change  of  variables  (q,p)  — >  (<f),  I)  preserves  areas,  so  that  a  suitable 
choice  for  /  is  derived  from  the  area  within  the  corresponding  trajectory  in 


Order  and  Chaos  in  Hamiltonian  Systems 


355 


Fig.  14.5 


the  (q,  p)  phase  plane  by 

I=^fpdq,  (14.4) 

as  indicated  by  the  equality  of  the  shaded  areas  ////  in  Fig.  14. 5(a), (b). 
Naturally  /  depends  directly  on  H,  and  the  natural  frequency  of  the  system 
for  the  corresponding  trajectory  is  then  given  by  (see  Table  14.2) 

"(/)  =  w  55  w  (14'5) 

It  also  follows  from  the  preservation  of  area  that  the  defining  relation  for  <j> 
is  given  by 


Q     l-q  d  fq 

diJo  pdq  =  udnJ0  pd<z'  (14'6) 


again  as  indicated  by  the  equality  of  the  shaded  areas  \\\\  in 
Fig.  14. 5(a), (b)  for  small  changes  AH,AI  in  H,  I  respectively.  The  'angle' 
nature  of  the  <fi  variable  can  be  emphasized  by  a  further  change  of  variables 
to  polars  (R,  <j>),  where  R  =  \p2J  in  order  to  maintain  the  preservation  of 
area  property.  (See  Fig.  14.5(c).) 

For  the  oscillator  problems  of  §2.1  and  §13.3  we  have 

2 

H(q,p)  =  ^-  +  V(q)=E,  (14.7) 


so  that  p  =  ± yj2m[H  —  V(q)]  and 


1  =  hfpdq  =     /'2  VH  ~ v{q)  dq-  (14-8) 
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Example:  Simple  harmonic  oscillator 

Find  action/angle  variables  for  the  simple  harmonic  oscillator. 
Here  V(q)  =  \kq2 ,  so  that 


/  =  H/ujq,     where    ujq  =  \/kJm, 


(14.9) 


since  §pdq  is  the  area  within  an  ellipse  with  semi-axes  yfWJk  and 
V2mH.  (See  Fig.  14.6.) 


Pi 

2m 

Area  of  ellipse  —  2ttI 
Fig.  14.6 

For  the  simple  harmonic  oscillator  we  have  action  =  en- 
ergy/frequency. Evidently 

8H 


and  action 


with  the  angle  variable 


u  =  —  =  uo  (here) 


p2  +  kmq2 


(14.10) 


(14.11) 


d    f  d    fq  i  

—  /   pdq=  —       V  2mw(1l  -  kmq2  dq 

)  fq  dq 

Jo   \J 2muiQl  ■ 


arcsin  yq 
uj0t  +  (3. 


kmq2 


21  J 


(14.12) 
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Of  course,  the  original  variables  q,p  can  be  found  in  terms  of  <f>,  I  in  the 
form 


q=\    sin^,       p  =  J 2muJoI  cos <b.  (14.13) 

V  muJo 

These  relations,  together  with  the  linear  evolution  of  <fi  with  time  give  the 
evolution  of  the  original  q,p  variables. 


The  elastic  bouncer 

For  a  perfectly  elastic  ball  bouncing  normally  between  two  walls  (see 
Fig.  14.7)  the  Hamiltonian  is  H  =  p2 /2m  =  E  =  \mv2 .  Here  the  ac- 
tion I  =  V2m~HL/  ty  so  that  uj  =  tv2 1 1 'rnL2  =  irv/L  and  the  period  of  this 
'oscillation'  is  r  =  2h/uj  =  2L/v  (of  course). 


(a)  Physical  space 


(b)  Phase  space 


Fig.  14.7 

The  results  are  hardly  very  surprising  for  these  two  cases,  but  the 
method  provides  a  powerful  means  of  obtaining  the  frequency  (and  so  the 
period)  of  a  general  nonlinear  libration  oscillation. 

The  method  may  be  extended  to  deal  with  rotations,  which  we  will  not 
consider  further  here,  and  to  deal  with  systems  which  have  two  or  more 
degrees  of  freedom.  So  we  have  the  following: 

Central  force  motion 

(See  Chapter  4.) 
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We  may  write  the  Hamiltonian  in  plane  polars  in  the  form 

with  constants  of  the  motion  F\  =  H,F2  =  p$.  We  can  then  calculate  the 
action  variables  as 


(14.15) 


Example:  Inverse  square  law 

For  the  inverse  square  attractive  force  law  V(r)  =  —  \k\/r  (see  §4.3 
and  Problem  9)  we  obtain 


Show  that  in  this  case  the  nonlinear  normal  mode  frequencies  are 
the  same,  so  that  the  orbits  are  closed. 

Evidently 

H=E=-i0w-  (14-I7» 

The  Hamiltonian  depends  only  on  the  action  variables,  and  the  corre- 
sponding angle  variables  are  ignorable,  by  design.  The  natural  frequen- 
cies for  this  system  are  then  given  by 

mk2  m  (    2E\3/2      2n  „  , 

^1=^2=  7T  ^rA3  =  m[  =— ,  14.18 

(h  +h)s      \k\  \    m  J  t 

where  r  is  the  period.  In  fact  this  is  just  Kepler's  third  law  of  plane- 
tary motion  (4.32),  since  the  semi-major  axis  a  of  the  elliptical  orbit  of 
negative  energy  E  is  given  by  a  =  —\k\/2E  (as  in  (4.30)). 
Note  that  important  results  for  this  system  have  been  obtained  without 
explicit  calculation  of  the  form  of  the  angle  variables: 


•  The  equality  of  the  natural  frequencies  of  the  radial  and  transverse 
modes  of  oscillation,  which  leads  to  orbital  closure.  (See  §14.1.) 
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•  The  relation  between  the  period  of  oscillation  and  the  energy  £  of  a 
bound  orbit. 

A  further  important  property  of  the  action  variables  will  be  indicated  later 
in  §14.5. 

14.4    Some  Hamiltonian  Systems  which  Exhibit  Chaos 

We  have  seen  in  Chapter  13  and  in  previous  sections  of  this  current  chap- 
ter that  a  system  must  have  at  least  two  degrees  of  freedom  in  order  to 
have  a  phase  space  of  large  enough  dimension  to  exhibit  any  chaotic  be- 
haviour. Again  the  system  must  also  then  have  fewer  symmetries  (leading 
to  conservation  laws)  than  are  required  to  give  integrability  and  order. 

Orbital  motion  —  n-body  problem 

In  §§14.1,  14.2  we  indicated  that  the  central  force  problem  of  Chapter 
4,  with  a  fixed  centre  of  force  and  two  degrees  of  freedom,  is  integrable, 
so  that  it  cannot  exhibit  the  property  of  sensitivity  to  initial  conditions 
characteristic  of  chaos. 

This  is  also  true  for  the  two-body  problem  in  the  centre-of-mass  frame. 
(See  §7.2.)  Less  intuitively  apparent  is  that  the  motion  of  a  particle  in  a 
plane  under  the  inverse  square  law  of  attraction  to  two  fixed  centres  may 
also  be  integrated. 

However  the  three-body  problem  is,  in  general,  not  integrable,  even 
when  it  is  restricted  (as  in  Chapter  10,  Problem  15);  this  problem,  together 
with  the  extension  to  more  bodies,  was  the  strong  motivation  for  Poincare 
in  his  general  considerations  for  dynamical  systems. 

The  three-body  problem  has  nine  degrees  of  freedom  and  so  gives  rise 
to  a  system  of  differential  equations  of  order  eighteen.  We  may  reduce  this 
to  order  twelve  by  making  use  of  the  conservation  of  linear  momentum,  i.e. 
by  going  to  centre-of-mass  co-ordinates.  By  making  use  of  conservation  of 
energy  and  conservation  of  angular  momentum  we  can  reduce  the  order 
to  eight.  By  further  device  and  restricting  our  consideration  to  motions 
which  are  planar,  the  order  can  be  reduced  to  four.  This  is  the  best  that 
can  be  done  in  general.  Even  after  all  these  reductions  the  problem  is 
still  extremely  complicated  and  has  kept  mathematicians  busy  for  several 
hundred  years. 

As  an  indication  we  can  consider  the  equations  of  the  planar  circular 
restricted  three-body  problem  (Chapter  10,  Problem  15).  In  the  frame  of 
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reference  rotating  with  the  primary  masses  M\,M2  about  their  centre  of 
mass  (see  Fig.  14.8),  the  equations  for  the  third  (small  mass  m)  body  can 


y 

M,        "i  O 

a. 

M2 

Fig.  14.8 


be  written  in  the  form 


with 


and 


x  —  2u>y  ■ 
y  +  2ujx  ■ 


dV 
dx 
dV 
dy  ' 


(14.19) 


1    2/   2^    2^       GMi  GM2 

\u)  (x  +y  )   -  - 


T2 


^(x  +  atf+y2, 


a2)2  +y2. 


This  system  can  be  expressed  in  Hamiltonian  form  with  two  degrees  of 
freedom  and  there  is  only  a  single  conserved  quantity,  i.e.  the  Hamiltonian 
itself,  so  that  \{x2  +  y1)  +  V  is  a  constant,  which  is  known  as  the  Jacobi 
integral.  While  there  are  some  stable  equilibria  for  this  system  (depending 
on  the  mass  ratio  M1/M2,  see  Chapter  12,  Problem  12)  there  are  also 
motions  which  are  highly  chaotic.  However,  the  motion  with  given  initial 
conditions  is  restricted  by  the  curves  of  zero  velocity  at  which  x  =  0  =  y. 
(See  Problem  12  of  this  Chapter.) 

Interest  in  the  restricted  problem  reflects  the  hierarchical  nature  of  our 
Solar  System  and  an  increasing  interest  in  space  missions.  The  primaries 
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are  variously  Sun-Jupiter  (Chapter  12,  Problem  13),  Earth-Moon  (Problem 
12)  and  so  on.  The  five  relative  equilibria  referred  to  in  Chapter  10,  Problem 
16  are  apparent  in  these  applications  —  as  sites  for  orbiting  observatories 
and  as  staging  points  for  more  extended  missions  by  NASA  and  ESA. 

The  Lagrangian  equilateral  triangle  rotating  in  its  plane  about  its  centre 
of  mass  is  actually  a  relative  equilibrium  for  the  three-body  problem  with 
general  masses. 

Example:  Three-body  problem 

Show  that  the  three-body  problem  with  three  general  masses 
"*iifi2,m3  has  a  relative  equilibrium  (with  the  masses  not  in  a 
straight  line)  if  and  only  if  the  triangle  formed  by  the  masses  is 
equilateral  and  find  the  rotation  rate  f2. 


Fig.  14.9 


Note  that  the  origin  in  this  figure  is  at  the  centre  of  mass.  It  is 
apparent  that  the  masses  remain  in  a  plane.  Then  we  have 

m\T\  +  m,2Ti  +  mzrz  =  0,  (14.20) 

together  with 

r12  r31 


;i(>2 
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and  two  similar  equations  for  7712,7713.  From  (14.20)  and  (14.21)  we 
obtain  a  relation  ar^  +  f3r-j  =  0  where  a,  (3  are  scalars.  Since  r2,  ^3  are 
by  hypothesis  not  parallel,  we  must  have  a  =  f3  =  0,  which  then  leads  to 
ri2  =  r-n  =  a  (say).  Of  course  the  similar  equations  give  r23  =  a  also, 
and  by  substitution  we  obtain  f22  =  G(mi  +  m2  +  m^)/di. 

Naturally,  stability  of  this  configuration  is  quite  another  matter  —  in 
general  it  is  surely  unstable  and,  typically,  the  system  will  separate. 

The  search  for  exact  periodic  solutions  of  the  three-body  problem  has 
resulted  in  an  astonishing  solution  for  the  case  of  three  equal  masses  —  the 
masses  follow  one  another  round  a  'figure  of  eight'  in  a  plane,  visiting  in 
turn  each  of  the  Euler  three-body  straight-line  equilibria  with  one  mass  at 
the  mid-point  of  the  line  joining  the  other  two  (see  Chenciner  and  Mont- 
gomery, Annals  of  Mathematics,  152,  881-901,  2000).  While  this  solution  is 
extremely  special,  it  is  exact  and  extensive  searches  have  uncovered  further 
exotic  n-body  solutions. 

In  general  the  vast  majority  of  n-body  systems  are  chaotic,  particularly 
on  account  of  close  two-body  encounters  in  their  evolution. 

Charged  particle  in  a  magnetic  field 

In  §5.5  as  an  example  of  rotating  frames  of  reference  we  considered  the  effect 
of  a  magnetic  field  Bona  charged  particle  moving  in  an  orbit  around  a 
fixed  charge.  We  assumed  there  that  the  magnetic  field  was  sufficiently 
weak  for  the  quadratic  term  in  B  to  be  neglected,  leading  to  the  Larmor 
precession  of  the  Kepler  elliptical  orbit  around  the  magnetic  field  vector. 
If  the  weak  magnetic  field  approximation  is  not  made,  then  the  equation 
of  motion  of  the  moving  charge  can  be  written  relative  to  the  frame  which 
rotates  about  the  magnetic  field  with  the  Larmor  frequency  in  terms  of  the 
Hamiltonian 

ff=fl-M+iNp2.  (14.22) 

Here  the  constant  B  field  defines  the  z  axis  and  p2  =  r2  —  z2,  so  that  the 
Kepler  inverse  square  law  force  of  attraction  towards  the  origin  (having  full 
rotational  symmetry  about  all  three  axes)  is  supplemented  by  a  Hooke  at- 
traction towards  the  B  field  axis  (having  rotational  symmetry  only  about 
this  axis).  As  a  result  the  Hamiltonian  system  is  not  integrable  —  it  is,  in 
effect,  a  'many-body'  Hamiltonian  and  exhibits  chaos.  This  broken  symme- 
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try  of  e.g.  the  classical  hydrogen  atom  is  complex,  but  very  important  in  its 
implications  for  the  quantum  mechanical  treatment  of  the  same  problem 
(see  e.g.  Friedrich  and  Wintgen,  Physics  Reports,  183,  37-79,  1989). 

Tops 

In  §10.3  and  in  §§12.3,12.4  we  indicated  the  Lagrangian  and  Hamiltonian 
descriptions  of  the  symmetric  top,  pivoted  at  a  point  on  its  axis  of  sym- 
metry. The  problem  has  three  degrees  of  freedom  and  there  are  three  in- 
dependent conserved  quantities  H,pv,p^,  so  that  the  system  is  integrable 
and  the  motions  detailed  are  regular  and  ordered. 

More  complicated  systems  abound  when  the  equations  of  rotational  mo- 
tion of  an  asymmetric  body  are  considered,  or  where  the  system  involves 
rolling  motion,  so  that  it  is  non-holonomic.  For  example,  various  asymmet- 
ric 'tops'  appear  to  exhibit  ordered  and  chaotic  motion  regimes  and  this 
is  common  for  systems  which  are  modelled  by  the  Euler  equations  of  §9.8 
with  general  torques  G. 

Billiard  systems 

After  Berry  (European  Journal  of  Physics,  2,  91-102,  1981)  we  can  consider 
the  motion  of  a  point  'ball'  moving  within  a  closed  boundary  (see  Fig.  14.10) 
and  bouncing  perfectly  on  the  wall.  We  assume  motion  in  a  straight  line 
between  collisions  with  the  wall,  simple  reflection  at  each  bounce  and  no 
dissipation,  i.e.  we  assume  that  the  Hamiltonian  is  H  =  p2/2rn  within 
the  enclosure.  The  ball  follows  a  path  just  like  that  of  a  light  ray  with  a 
boundary  wall  which  is  a  perfect  mirror. 


Fig.  14.10 
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Despite  its  apparent  simplicity,  this  system  turns  out  to  be  very  rich 
and  instructive,  with  dynamics  depending  particularly  on  the  shape  of  the 
enclosure. 

The  dynamics  can  be  examined  using  natural  canonical  variables,  s,p, 
where  s  is  the  distance  measured  along  the  boundary  and  p  =  cos  a,  where 
a  is  the  angle  of  incidence  (and  reflection)  at  the  wall.  (Note  that  p  here  is 
not  the  magnitude  of  the  linear  momentum,  \p\.)  This  leads  to  a  successive 
bounce  map  {sn,pn)  — >  (sn+i,pn_|_i)  which  preserves  area  and  leads  to  a 
Poincare  section  for  the  dynamics  —  the  slice  through  the  overall  dynamical 
space  is  the  (non-planar)  boundary  shape. 

For  motion  in  a  planar  region  of  this  kind  the  system  has  two  degrees 
of  freedom  and  H  is  a  constant  of  the  motion.  For  some  special  enclosure 
shapes  e.g.  circular  (see  Fig.  14.11),  elliptical  (see  Fig.  14.12)  there  is  in 
each  case  another  conserved  quantity  leading  to  integrability  and  order. 

Example:  Circular  billiard 


(a)  ^^^^^^^ 

(c)        ^y7^^                <d)  , 

•  • 

V/V                       ^-/-V       cos  ^1T^ 

I 

„  =  W?\\                   (  P\ 

) 

L 

<^  a=3ir/5 

Fig.  14.11  Billiards  in  a  circle:  (a)  basic  orbit  geometry;  (b)  typical  orbit  (never 
closing);  (c)  two  closed  orbits;  (d)  phase-space  trajectories  for  orbits  in  (b)  and  (c). 

Show  that  there  is  a  second  conserved  quantity  (independent  of  H) 
for  the  circular  billiard. 
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Evidently  the  billiard  trajectory  for  a  circular  enclosure  is  a  succes- 
sion of  chords,  each  at  the  same  perpendicular  distance  from  the  centre 
of  the  circle.  So  the  angular  momentum  of  the  ball  about  the  centre 
(=  aV2mH p,  where  a  is  the  radius  and  p  =  cos  a)  remains  constant 
through  successive  bounces  at  the  boundary.  The  angular  momentum 
is  the  second  conserved  quantity  and  is  evidently  independent  of  the 
Hamiltonian  H.  Note  that  when  a  chord  meets  the  circular  boundary 
the  angle  a  between  it  and  the  tangent  to  the  circle  is  the  same  at  each 
bounce  —  the  bounce  map  for  a  particular  trajectory  therefore  consists 
of  a  finite  number  of  points  or  a  line  (each  at  p  =  cos  a)  according  as  the 
trajectory  is  closed  or  is  not  closed. 

For  the  elliptical  enclosure  the  second  conserved  quantity  is  the  product 
of  the  angular  momenta  of  the  ball  measured  about  the  two  foci  of  the 
ellipse.  (See  Problem  10.) 


Fig.  14.12  Orbits  in  an  ellipse:  (a)  repeatedly  touching  a  confocal  ellipse;  (b)  repeatedly 
touching  confocal  hyperbolae;  (c)  ellipse  billiard  mapping. 
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In  these  cases  the  bounce  map  (Poincare  section)  is  made  up  of  points 
and  lines  characteristic  of  integrability.  When  the  shape  of  the  boundary 
is  not  so  special,  the  bounce  map  has  regions  in  which  there  is  area  filling, 
leading  to  chaos,  as  well  as  regions  where  ordered  motion  predominates. 
(See  Fig.  14.13.) 

Billiard  systems  with  various  different  properties  have  been  used  to 
model  other  types  of  physical  behaviour. 
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Fig.  14.13    Oval  billiard  mapping:  25  orbits  followed  through  200  bounces. 


Dice  and  coins 

The  motion  when  dice  and  coins  are  thrown  is  regular  when  there  are 
no  external  torques,  i.e.  when  moving  and  rotating  through  space,  with- 
out any  air  resistance.  However,  there  are  impulsive  forces  which  act  at 
each  bounce,  so  that  for  typical  shapes  the  motion  can  be  highly  chaotic. 
Whereas  billiards  (see  above)  involve  a  regular  object  bouncing  on  an  irreg- 
ular boundary,  dice  (and  coins)  are  a  sort  of  'dual'  in  that  they  are  irregular 
objects  bouncing  on  a  regular  surface.  For  each  of  these  types  of  system  the 
effects  of  small  changes  in  physical  geometry  can  be  large  in  the  statistical 
properties  of  the  outcomes  (see  §13.7),  whether  any  dissipation  which  may 
be  present  is  very  small  or  if  it  is  such  as  to  lead  to  rapid  energy  decay. 
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For  example,  when  a  regular  cubical  die  is  subject  to  a  small  change 
A  in  aspect  ratio  (away  from  1  for  the  perfect  cube),  each  1%  change  in 
this  geometric  measure  leads  to  a  3.4%  change,  approximately,  in  the  prob- 
abilities associated  with  the  faces  displayed,  a  change  modelled  locally  by 
A2is/2/7r(41/2/3  —  3)  (Berkshire,  British  Association  for  the  Advancement 
of  Science,  1985). 

The  swinging  Atwood's  machine 

A  simple  physical  system  which  exhibits  a  variety  of  dynamical  behaviours 
is  the  swinging  Atwood's  machine,  which  consists  of  two  masses  M,  m  con- 
nected by  a  light  inextensible  string  which  passes  over  a  small  pulley  and 
through  a  small  hole  in  a  smooth  vertical  plane,  so  that  the  mass  m  can 
rotate  in  this  plane.  The  string  is  such  that  collisions  of  masses  with  pulley 
or  hole  can  be  discounted.  (See  Fig.  14.14.) 


g 


Fig.  14.14 

This  system  was  introduced  first  by  Tufillaro,  Abbott  and  Griffiths 
(American  Journal  of  Physics,  52,  895-903,  1984)  and  has  been  the  subject 
of  various  subsequent  papers. 

The  Hamiltonian  for  the  system  is 

H  =  2m(l  +  ]T)  +  W  +  mgr^  ~  COSg)'    With    'i  =  M/m'  (1423) 

and  the  motion  is  not,  in  general,  integrable,  since  H  is  usually  the  only 
constant  of  the  motion.  In  the  case  /i  >  1  we  can  show  that  the  motion 
of  m  is  always  bounded  by  a  curve  of  zero  velocity  (p  =  0) ,  which  is  an 
ellipse  whose  shape  depends  on  the  mass  ratio  n  and  on  the  energy  H .  (See 
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Problem  11.)  When  /i  <  1  the  motion  is  not,  in  general,  bounded  for  any 
energy  and  eventually  the  mass  M  passes  over  the  pulley. 

The  system  is  integrable  in  the  case  /i  =  3  and  apparently  only  for  this 
particular  value  of  the  mass  ratio.  In  that  special  case,  there  is  a  second 
conserved  quantity  given  by 

pe  (      o    2Pe  .  e\       2  .  e    2  e        ,  A  nj, 

J  =  - —    pr  cos  sin  -    +  mgr  sm  -  cos  -.  (14.24) 

Am  \         2       r        2  J  2  2 

From  Hamilton's  equations,  r  =  Pr/Am,  pr  =  (p^/mr3)  —  mg(3  —  cosS), 
9  =  pe/mr2,  pg  =  —mgrsm.6,  one  can  show  by  brute  force  that  J  =  0.  We 
note  that  while  J  is  also  quadratic  in  the  momenta  pr,Ps,  it  is  independent 
of  the  Hamiltonian  H . 

It  is  certainly  not  yet  clear  what  'physical'  property  is  represented  by 
this  'hidden  symmetry'.  When  /i  =  3  the  motion  is  completely  ordered. 
For  all  other  mass  ratio  values  there  are  at  least  pockets  of  highly  irregular 
and  chaotic  motion. 


The  Henon-Heiles  system 

From  the  previous  examples  it  should  be  apparent  that  a  paradigm  of 
chaotic  dynamics  is  provided  by  a  Hamiltonian  system  with  two  degrees 
of  freedom  which  has  only  one  conserved  quantity,  e.g.  the  Hamiltonian 
itself.  Under  these  circumstances  the  system  dynamics  can  show  all  the 
features  of  sensitivity  to  initial  conditions  and  loss  of  predictability  indi- 
cated in  §13.7.  As  a  final  example  consider  the  motion  of  a  particle  of  unit 
mass  in  a  two-dimensional  asymmetric  potential  well  (see  Fig.  14.15)  so 
that  its  Hamiltonian  is 

H=^(Pl+p2y)  +  V(x,y),  (14.25) 

where 

V=^+y2)+X2y_lyZ 

—  \r'2  +  I7"3  sm3#       in  plane  polars. 

This  system,  with  the  analogy  of  a  small  ball  rolling  under  gravity  in 
an  asymmetric  bowl,  was  introduced  by  Henon  and  Heiles  (Astronomical 
Journal,  69,  73-79,  1964)  in  order  to  model  the  motion  of  a  star  in  a  galaxy 
which  has  a  simple  smoothed-out  gravitational  potential  to  represent  the 
attraction  of  the  other  stars. 
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Fig.  14.15    Henon-Heiles  potential  contours  of  V  near  the  local  minimum  at  (0,0). 

For  small  energies  the  cubic  terms  in  V  are  insignificant  when  compared 
with  the  quadratic  terms  and  in  that  limit  the  motion  is  integrable,  con- 
sisting as  it  does  of  a  combination  of  normal  modes  as  obtained  by  the 
methods  of  Chapter  11.  Of  course  the  motion  for  small  energies  is  just 
that  of  the  isotropic  oscillator  of  §4.1.  For  larger  energies  corresponding  to 
larger  displacements  away  from  the  potential  minimum  at  the  origin  the 
motion  becomes  highly  irregular. 

There  are  very  similar  consequences  for  many  other  such  systems,  e.g. 
the  double  pendulum  system  of  §11.1,  when  we  now  allow  9,  ip  not  to  be 
restricted  to  be  small. 

14.5    Slow  Change  of  Parameters  —  Adiabatic  Invariance 

Let  us  suppose  that  in  a  cyclic  system  (librating  or  rotating)  we  allow  some 
of  the  control  parameters  to  change  on  a  very  long  time  scale,  i.e.  on  a 
scale  of  many  basic  oscillation  periods.  Then  it  emerges  that  the  change 
in  an  action  /  is  then  also  very  slow,  in  the  sense  that  we  can  change 
the  parameters  by  as  large  an  amount  as  we  may  want,  while  /  stays  as 
near  constant  as  we  want,  provided  that  the  change  of  parameters  is  made 
slowly  enough.  The  change  of  parameters  is  then  said  to  be  carried  out 
adiabatically  and  the  action  variables  are  then  called  adiabatic  invariants. 

Now  a  general  proof  of  this  property  is  beyond  our  scope  here,  so  let  us 
consider  by  way  of  illustration  the  simple  harmonic  oscillator  of  §2.2,  but 
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now  with  a  frequency  which  is  slowly-varying.  That  is  to  say 

x  +  [u){t)]2x  =  0,  (14.26) 


with  u>  =  y/k/m,  where  lo/uj  «C  u>.  The  variation  in  uj  could,  for  example, 
be  brought  about  by  a  spring  constant  k  which  varies  on  a  time  scale  which 
is  very  long  compared  with  the  natural  period  r  =  2-r/uj.  Over  a  single 
period  (during  which  u>  does  not  change  appreciably)  we  may  write 

x(t)  =  acos[u)(t)t  -  6],  (14.27) 

as  in  (2.20). 

The  energy  of  the  oscillation  is 

E  =  \m(x2  +  io2x2)  =  \muj2a2,  (14.28) 

essentially,  and  this  now  changes  with  time  at  the  slow  rate 

E  =  m(xx  +  uj2xx  +  ujlox2)  =  muiuix2.  (14.29) 

By  averaging  over  a  period  of  the  oscillation,  so  that  e.g.  (E)  =  i  fQr  E  At  = 
E  (above)  we  obtain 


^(E)  =  mujuj{x2)  =  ij 


-(E). 


(14.30) 


We  obtain  immediately 

d_ 

dt 


(E) 


10 


so  that  {E)/uj  is  essentially  constant.  This  is,  of  course,  our  measure  of  the 
action  I  of  this  oscillator.  (See  (14.9).) 

For  more  general  systems  the  demonstration  is  carried  out  via  the  def- 
inition (14.4)  of  the  action  variable.  Particular  physical  applications  are 
very  common  and  valuable  conclusions  can  be  drawn  about  the  variations 
of  other  variables  on  the  assumption  of  the  applicability  of  the  principle  of 
adiabatic  invariance. 

Example:  Pendulum  with  slowly  varying  length 


Find  how  the  amplitude  of  (small)  oscillation  of  a  simple  pendulum 
changes  as  the  length  is  varied  slowly. 
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We  can  consider  the  simple  pendulum  of  §2.1,  but  now  with  length 
l(t)  changing  slowly  with  time,  and  we  can  simplify  considerably  by 
confining  ourselves  to  small  oscillations.  We  then  need  only  put  x  =  10 
in  the  above  analysis  of  the  simple  harmonic  oscillator,  together  with 


lem  13)  and  we  can  find  from  this  how  the  energy,  displacement,  maxi- 
mum acceleration  of  the  pendulum  bob  change  as  I  is  changed,  possibly 
by  orders  of  magnitude,  on  the  assumption  that  the  change  in  I  is  effected 
sufficiently  slowly. 

A  practical  problem  to  contemplate  when  considering  this  matter  is  the 
consumption  of  a  piece  of  spaghetti,  although  this  is  not  usually  a  simple 
pendulum  and  politeness  may  well  rule  out  the  necessary  long  time  scale 
for  consumption!  However,  the  general  principle  is  the  same,  particularly 
with  regard  to  the  dangers  of  spaghetti  sauce! 

The  elastic  bouncer 

In  §14.3  we  considered  a  perfectly  elastic  ball  bouncing  normally  between 
two  walls  and  the  action  /  for  that  case  was  found  to  be  \j2mH  L/n  where 
to,  H  are,  respectively,  the  mass  and  energy  of  the  ball  and  L  is  the  distance 
between  the  walls.  If  the  walls  are  now  moved  either  towards  or  away  from 
one  another  very  slowly  ('adiabatically'),  then  the  principle  of  adiabatic 
invariance  tells  us  that  L\f~H  should  be  essentially  constant,  so  that  v  oc  1/L 
and  energy  H  oc  1/L2  (see  Problem  14).  Again  it  should  be  noted  that  v,  H 
can  change  by  orders  of  magnitude,  while  the  action  /  remains  essentially 
constant.  We  can  note  here  that  we  can  model  a  monatomic  gas  with  elastic 
molecules  bouncing  in  a  cubical  box  and,  under  reasonable  assumptions, 
obtain  the  relationship  that  pressure  oc  (density)5/3  for  adiabatic  change. 

Attractive  central  force  problem  with  varying  strength 

Evidently  the  force  of  attraction,  e.g.  of  the  Sun,  would  diminish  as  its 
mass  decreases  over  what  is,  for  us,  a  long  time.  It  has  also  been  suggested 


I  =  E/uj  =  \mu>a2 

=  |TOVff^(^max)2, 


(14.32) 


since  a  =  Z0max  here. 

The  assumption  that  I  is  invariant  leads  to  9mal  oc  l~3/4  (see  Prob- 
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by  Dirac  that  the  gravitational  'constant'  may,  in  fact,  change  on  a  long 
time  scale.  In  either  case  we  can  look  to  the  action/angle  formulation 
in  §14.3,  where  I2  is  an  exact  invariant  and  I\  can  be  taken  now  as  an 
adiabatic  invariant.  As  a  consequence  we  can  show,  in  particular,  that,  as 
the  strength  \k\  of  the  attracting  force  decreases,  the  period  and  the  semi- 
major  axis  of  the  Kepler  elliptical  orbit  of  a  planet  both  increase  (as  we 
might  expect),  but  that  (rather  unexpectedly)  the  eccentricity  of  the  orbit 
remains  constant  and  the  orbit  retains  its  shape.  (See  Problem  16.) 

The  adiabatic  invariance  of  the  action  variables  has  enormous  impor- 
tance in  the  transition  to  quantum  mechanics.  The  pendulum  with  slow 
variation  in  length  was  a  discussion  problem  for  Lorentz  and  Einstein  in 
1911,  when  the  foundations  of  quantum  mechanics  were  being  laid.  The 
effective  constancy  of  the  action  variables  by  the  principle  of  adiabatic  in- 
variance, in  the  classical  formulation  as  control  parameters  are  changed  very 
slowly,  makes  the  action  variables  the  prime  candidates  for  quantization. 

14.6    Near-integrable  Systems 

When  we  have  a  system  which  is  Hamiltonian  with  H  =  Ho{q,p)  and 
integrable,  so  that  its  dynamics  are  regular  and  ordered,  we  can  ask  what 
happens  for  a  system  which  is  not  itself  integrable,  but  which  is  close  by. 
That  is  to  say  this  latter  system  has  Hamiltonian  H  =  H 0  +  eHi  where  e 
is  very  small. 

When  there  is  only  one  degree  of  freedom  (n  =  1)  there  is  a  well- 
developed  perturbation  theory,  which  allows  us  to  find  out  how  the  ac- 
tion/angle variables  for  the  Ho  system  are  changed  by  the  perturbation 
tH\.  We  won't  detail  this  here,  but  it  turns  out  that  the  only  real  diffi- 
culty with  the  perturbation  expansions  (as  power  series  in  e)  is  associated 
with  attempting  to  expand  near  where  the  basic  frequency  uiq  of  the  un- 
perturbed system  Hq  is  zero.  An  example  of  this  is  provided  by  the  simple 
pendulum  and  the  libration/rotation  orbits  which  are  situated  close  to  the 
separatrices  (at  which  ujq  =  0).  The  perturbation  expansions  break  down  in 
this  neighbourhood,  although  they  generally  work  well  elsewhere.  A  simple 
remedy  would  seem  to  suggest  itself  strongly  —  avoid  separatrices! 

For  systems  with  at  least  two  degrees  of  freedom  (n  >  2)  the  perturba- 
tion analysis  seems  to  be  plagued  by  a  similar  difficulty,  but  much  worse  in 
implication.  The  expansions  for  the  n  =  1  case  contain  denominators  u>q, 
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so  that  the  expansions  become  no n- uniformly  valid  near  ujq  —  0.  For  the 
n  >  2  case  we  find  in  the  expansions  denominators  like  o>0  ■  N  where  the 
lattice  vector  N  has  integer  components  and  ijJq  is  the  vector  of  normal 
frequencies  of  the  particular  torus  (see  §14.1)  about  which  we  are  trying  to 
expand  our  series  solution. 

Now  when  this  denominator  is  actually  zero  we  have  a  degeneracy  (or 
resonance),  in  that  some  of  the  normal  frequencies  have  rational  ratios. 
This  is  bad  enough,  but  the  trouble  is  that,  even  for  a>()  not  exactly  reso- 
nant in  this  manner,  we  can  apparently  make  choices  of  TV  components  in 
order  to  make  u>o  ■  N  as  close  as  we  might  (not)  like  to  zero  —  thus  de- 
stroying the  usefulness  of  our  expansion.  This  is  usually  called  the  problem 
of  small  denominators  and  it  seems  to  imply  that,  only  if  the  perturbation 
tH\  to  our  Hamiltonian  Hq  is  such  that  H  =  Ho  +  eH\  is  still  integrable 
through  the  preservation  of  a  sufficient  number  of  conserved  quantities,  is 
the  phase  space  torus  structure  itself  preserved,  although  distorted,  and 
with  convergent  perturbation  expansions. 

Very  elaborate  methods  have  been  developed  in  an  attempt  to  patch  up 
the  expansions  in  the  general  case.  Especially  noteworthy  is  a  tour  de  force 
by  Delaunay  for  the  Sun-Earth-Moon  system  ('lunar  theory')  undertaken 
in  the  middle  of  the  nineteenth  century,  which  involved  about  twenty  years 
of  endeavour,  but  all  such  efforts  seemed  doomed  to  failure.  It  seemed  that 
the  slightest  perturbation,  however  small,  would  cause  the  torus  structure 
of  an  integrable  system  to  collapse  irretrievably.  That  this  is  not  in  fact 
the  case  was  demonstrated  by  Kolmogorov,  Arnol'd  and  Moser  (KAM)  in 
work  originating  between  1954  and  1963.  This  work  has  led  to  an  explosion 
of  research,  results  and  applications.  KAM  showed  that,  as  long  as  the 
perturbation  introduced  to  a  smooth  system  is  itself  smooth  enough,  then 
the  torus  destruction  with  a  perturbed  system  is  progressive  as  the  size  of 
the  perturbation  increases. 

Key  roles  are  played  in  the  theory,  and  its  consequences,  by  the 
following: 

•  Rational  approximation.  This  means  the  closeness  by  which  real  num- 
bers can  be  approximated  by  rationals.  This  is  important,  because  the 
torus  breakdown  grows  with  the  perturbation  around  the  resonant  (u}q 
with  rational  ratios)  tori. 

•  Fractals.  These  are  'self-similar'  structures  in  maps,  i.e.  in  Poincare 
sections  (like  e.g.  Fig.  14.13)  and  in  full  phase  space.  Properties  of  maps 
are  explored  briefly  in  Appendix  D. 
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•  Divergence  of  trajectories.  Breakdown  of  the  torus  structure  gives  free- 
dom to  system  trajectories  to  diverge  strongly  from  their  neighbours, 
leading  to  'extreme  sensitivity  to  initial  conditions'  (see  §13.7)  and  to 
chaotic  behaviour. 

•  Lack  of  predictability.  We  have  effectively  stochastic  output  resulting 
even  from  strictly  deterministic  systems,  so  that  we  lose  knowledge  of 
the  connection  between  cause  and  effect. 

To  the  big  question  'Is  the  Solar  System  stable?',  which  has  motivated  a  lot 
of  work  in  Mechanics,  the  theory  provides  the  answer  'a  (definite)  maybe'. 
The  Solar  System  is  an  n-body  problem,  which  is  markedly  hierarchical,  so 
that  the  effect  of  e.g.  planet/planet  interactions  is  of  lower  order  than  the 
planet /Sun  interactions  which  produce  Kepler  ellipses.  For  this  reason  the 
broad  structure  of  the  system  is  robust  (fortunately  for  the  human  race!). 

For  large  configurations,  stable  and  unstable  regimes  may  well  be  closely 
interwoven,  so  that  in  an  idealized  system  we  might  not  be  able  to  tell  from 
(necessarily)  imprecise  observations  quite  what  regime  we  are  in!  However 
we  should  find  the  hierarchical  nature  of  our  system,  with  a  strong  central 
Sun,  to  be  reassuring  apart  from  the  possibility  of  asteroid  impact  —  in 
practice  the  fate  of  the  whole  Solar  System  is  actually  rather  likely  to  be 
determined  largely  by  dissipative  and  other  non-gravitational  forces. 

14.7  Summary 

In  this  chapter  the  importance  of  the  number  of  symmetries  of  a  Hamil- 
tonian  system  (leading  to  conservation  laws)  has  been  shown  in  the  re- 
strictions they  impose  on  the  freedom  of  the  system  trajectories  in  the 
appropriate  phase  space. 

Integrability  of  the  system  leads  to  a  natural  formulation  in  terms  of 
action/angle  variables,  which  allow  us  to  detail  the  nonlinear  normal  modes 
of  such  a  system,  i.  e.  modes  which  are  exact  and  involve  no  approximations 
for  small-amplitude  motions. 

Examples  have  been  given  of  simple/important  Hamiltonian  systems 
which  have  regimes  which  exhibit  chaos  and  a  resulting  breakdown  of  pre- 
dictability. 

The  action  variables  are  effectively  constant  when  system  control  pa- 
rameters are  changed  very  slowly.  This  principle  of  adiabatic  invariance  is 
a  general  result,  which  has  non-intuitive  and  useful  consequences  for  the 
way  in  which  other  quantities  of  physical  interest  may  change,  possibly  by 
orders  of  magnitude  on  these  long  time  scales. 
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The  adiabatic  invariance  of  action  variables  identifies  them  as  prime 
candidates  to  be  quantized  in  the  transition  from  classical  to  quantum 
mechanics. 

There  are  other  routes  to  chaos.  Indeed  e.g.  the  limit  cycles  of  Chapter 
13,  each  with  a  single-point  Poincare  section,  can  go  through  a  period- 
doubling  cascade  to  chaos  (see  Appendix  D)  as  parameters  are  changed. 
When  trajectories  are  stretched,  squeezed  and  folded  then  chaos  and  'mix- 
ing' typically  result. 

It  is  fitting  that  this  book  should  end  with  the  mention  of  results  (KAM) 
which  point  the  way  to  a  whole  host  of  future  developments,  by  no  means 
the  least  of  which  is  the  question  of  how  to  quantize  a  system  which  is 
classically  chaotic! 

As  we  said  in  the  Preface,  Classical  Mechanics  is  a  very  old  subject. 
However  it  is  very  much  alive! 


Problems 

1.  A  particle  of  mass  m  is  projected  outward  radially  from  the  surface 
(q  =  R)  of  a  spherical  planet.  Show  that  the  Hamiltonian  is  given  by 
H  =  p2 /2m  —  \k\/q  (with  k  constant),  so  that  H  is  a  constant  of  the 
motion  (=  energy  E).  Sketch  the  phase  portrait  in  the  (q,p)  phase  plane 
for  q>  R,  distinguishing  between  trajectories  which  correspond  to  the 
particle  returning  and  not  returning  to  the  planet's  surface.  When  the 
particle  does  return  show  that  the  time  taken  to  do  this  is 


Evaluate  this  integral  to  find  to  in  terms  of  h,R,  \k\/m.  (Hint:  the 
substitution  q  =  /isin2  9  is  helpful!)  By  considering  the  limit  R/h  — >  0 
show  that  the  result  is  in  accord  with  Kepler's  third  law  (4.32). 
2.  For  an  autonomous  Hamiltonian  system  with  two  degrees  of  freedom 
the  Hamiltonian  if  is  a  constant  of  the  motion  and  such  a  system  is 
integrable  if  another  constant  F  (independent  of  H )  can  be  found  — 
i.e.  such  that  [F,  H]  =  0. 

(a)  For  the  double  harmonic  oscillator  (14.3)  show  that  we  may  take 
F  =  p\/2m  +  \mio\q\  as  the  second  constant. 

(b)  For  central  force  motion  (14.14)  show  that  we  may  take  F  =  pg  as 
the  second  constant. 


where    h  =  r-=r. 
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(c)  *When  the  potential  V(r)  =  —\k\/r,  show  that  the  components  of 
the  vector 

p  A  J  r 

m\k\  r 

with  p,  J  respectively  linear,  angular  momentum)  are  constants  of  the 
motion. 

(A  is  the  Laplace-Runge-Lenz  vector.  It  can  be  shown  using  the  results 
in  §4.4  that  it  has  magnitude  given  by  the  eccentricity  e  of  the  conic- 
section  orbit  and  is  directed  from  the  centre  of  force  along  the  major 
axis  towards  the  position  of  closest  approach  ('perihelion').) 

3.  A  string  of  length  I  with  a  mass  m  at  each  end  passes  through  a  hole 
in  a  frictionless  horizontal  plane.  One  mass  moves  horizontally  on  the 
plane  and  the  other  mass  hangs  vertically  downwards.  Show  that  a 
suitable  Hamiltonian  for  the  system  is 

H  =T~  +  A-mg(l-r), 
4m  imrz 

where  (r,  6)  are  the  polar  co-ordinates  of  the  particle  on  the  plane  and 
pr,Pe  are  the  corresponding  momenta.  Identify  two  constants  of  the 
motion.  Show  that  a  steady  motion  with  r  =  ro  is  possible  (for  any 
ro  >  0) ,  if  pe  is  chosen  suitably,  and  that  the  period  of  small  oscillations 
about  this  motion  is  27r^/2ro/3<?. 

4.  Consider  again  the  particle  sliding  on  the  inside  of  a  smooth  cone,  as 
in  Chapter  12,  Problem  1.  Show  that  when  a  stable  circular  motion  is 
disturbed  the  resulting  small  oscillations  are  only  closed  if  v^sina  is 
a  rational  number. 

5.  For  central  force  motion  with  an  inverse  square  law  force  of  attraction 
the  Hamiltonian  is  (14.14),  i.e. 

H=f  +  -^-^(=  energy  E). 
2m     2mrA  r 

If  we  fix  pe,  show  that  r  is  bounded  (n  <  r  <  r^)  only  when 
—k2m/2pg  <  E  <  0  and  that  the  energy  minimum  corresponds  to 
an  orbit  in  physical  space  which  is  a  circle.  Sketch  the  curves  H  =  con- 
stant in  the  (r,pr)  projection  of  the  full  four-dimensional  phase  space 
for  this  system.  Consider  this  projection  in  the  light  of  the  discussion 
of  surfaces  of  section  in  §14.2. 

6.  A  particle  of  mass  m  is  constrained  to  move  under  the  action  of  gravity 
in  the  vertical  (x,  z)  plane  on  a  smooth  cycloid  curve  given  paramet- 
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rically  by  x  =  1(9  +  sin#),  z  =  1(1  —  cos  8).  Show  that  a  suitable 
Hamiltonian  is 

H  =   -  +  mgl(l-cos9). 

Ami1  (I  +  cos  9) 

Use  action/angle  variables  to  show  that  the  frequency  of  oscillation  of 
the  particle  is  independent  of  its  amplitude,  i.e.  it  is  the  same  for  all 
initial  conditions  with  \0\  <  it. 

(The  substitution  s  =  sin  ^9  is  useful.  This  tautochrone  property  of 
the  cycloid  was  known  to  Huygens  in  the  seventeenth  century  and,  in 
principle  at  least,  it  leads  to  some  quite  accurate  clock  mechanisms. 
Contrast  the  tautochrone  property  with  the  brachistochrone  property 
of  Chapter  3,  Problem  15.) 

7.  *A  particle  of  mass  m  is  attached  to  the  origin  by  a  light  elastic  string 
of  natural  length  I,  so  that  it  is  able  to  move  freely  along  the  x-axis 
if  its  distance  from  the  origin  is  less  than  I,  but  otherwise  moves  in  a 
potential  V(x)  =  ^k(\x\  —  I)2  for  |a;|  >  I.  If  the  particle  always  moves 
in  a  straight  line,  sketch  the  potential  and  the  phase-plane  trajectories 
for  different  values  of  the  energy  E.  Show  that  E  =  (<JKI  +  ~(P  -  P)2, 
where  I  is  the  action,  (5  =  \p2klj-K,  Q  =  ^Jk/m.  Explain  briefly 
(without  detailed  calculation)  how  the  angle  variable  <j>  conjugate  to 
the  action  may  be  found  in  the  form  (j>(x)  and  how  x  may  be  found  as  a 
function  of  the  time  t.  What  happens  in  the  (separate)  limits  of  small 
and  large  energies  El 

8.  *A  particle  of  mass  m  moves  in  a  one- dimensional  potential  V(q)  = 
i;(kq2  +  X/q2),  where  k,X,q  >  0.  Sketch  the  potential  and  the  phase 
portrait.  Show  that  the  energy  E  and  action  /  are  related  by 

E  =  VkX  +  2I^/k/m 

and  that  the  period  is  then  independent  of  amplitude.  Discuss  how  the 
dependence  of  q  on  the  angle  variable  (f>  may  be  found  and  then  the 
dependence  of  q  on  the  time  t. 

(This  one-dimensional  problem  models  the  purely  radial  part  of  the 
motion  of  the  isotropic  harmonic  oscillator  of  §4.1.  The  integral 


/" 


■  Ax 


sj (x2  -  x)(x  -  Xi)  —  =  §7r(xi  +  x2)  -  H^XiX2 


where  x2  >  x\  >  0  and  x  =  q2  will  prove  useful!) 
9.  *For  central  force  motion  with  an  inverse  square  law  force  of  attraction 
(see  Problem  5)  confirm  the  result  of  (14.16)  that  the  radial  action  I\  = 
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—I2  +  \\k\\f^fn/{—E),  where  I2  =  Pe,  so  that  the  natural  frequencies 
of  radial  and  transverse  motions  of  the  system  are  the  same. 
(In  order  to  evaluate  the  integral  in  (14.15)  the  result  given  at  the  end 
of  Problem  8  will  prove  similarly  useful!) 

10.  For  the  billiard  in  an  elliptical  enclosure  (Fig.  14.12)  use  the  result  of 
Appendix  B,  Problem  2  to  show  that  the  product  A  of  the  angular  mo- 
menta of  the  ball  measured  about  the  two  foci  of  the  ellipse  is  preserved 
through  each  bounce,  so  that  it  is  conserved  and  therefore  a  constant 
of  the  motion. 

Using  elliptical  co-ordinates  (see  Chapter  3,  Problem  24),  show 
that  A  =  (cosh2  A  -  cos2  6>)_1(sinh2  Xpj  -  sm26p2x)  and  that  H  = 
[2mc2(cosh2  A  —  cos2  0)]~1(p\  +Pg).  Hence  show  that  the  reflected  tra- 
jectory from  the  boundary  x2 /a2  +  y2  jb2  =  1  is  necessarily  tangent 
(when  A  >  0)  to  the  ellipse  A  =  arcsinhi/ A/2rnc2H. 
[Closure  for  such  a  trajectory  implies  closure  for  all  trajectories  tangent 
to  the  same  inner  ellipse  —  an  example  of  a  general  result  due  to 
Poncelet  (1822).] 

11.  *Consider  the  motion  of  the  swinging  Atwood's  machine  (see  Fig.  14.14) 
for  various  different  values  of  the  mass  ratio  [i  =  M/m. 

(a)  Show  that  when  fi  >  1  the  motion  of  m  is  always  bounded  by  a 
zero-velocity  curve  which  is  an  ellipse,  whose  shape  depends  on  /1 
and  on  the  constant  energy  E. 

(b)  Show  that  when  /1  <  1  the  motion  is  not  in  general  bounded  for 
any  E  by  sketching  the  zero-velocity  curves  for  mass  m  in  the  r,  9 
plane  (in  various  cases) . 

(Hint:  The  results  of  §B.2  will  prove  helpful  in  identifying  the  zero- 
velocity  curves.) 

12.  *Show  that  the  equations  of  motion  (14.19)  for  the  restricted  three- 
body  problem  may  be  put  into  Hamiltonian  form  using  the  substitution 
px  =  x  —  coy,  pv  =  y  +  lux.  Show  that,  since  the  system  is  autonomous, 
i(i2  +  y2)  +  V  =  C,  where  C  is  a  constant  of  the  motion  (Jacobi). 
Hence  sketch  the  regions  V  <  C  of  possible  motions  in  the  x,y  plane, 
for  various  values  of  C  and  of  the  mass  ratio  M1/M2,  considering  par- 
ticularly those  values  of  C  corresponding  to  the  three  'equilibria'  on 
the  x  axis  and  to  the  two  Lagrangian  equilateral  triangle  'equilibria'  — 
see  Chapter  10,  Problem  16.  (Note  that  for  the  Earth/Moon  system 
with  M1/M2  =  81.3  there  is  in  consequence  a  value  of  C  below  which 
an  Earth-Moon  transfer  is  not  possible.) 
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13.  For  the  small  oscillations  of  a  pendulum  whose  length  I  is  varying  very 
slowly  show  that  the  maximum  angular  displacement  #max  oc  Z-3/4. 
Hence  show  that  the  maximum  sideways  displacement  from  the  vertical 
is  oc  Z1/4  and  that  the  maximum  acceleration  is  oc  Z-3/4.  (Note  that  this 
last  result  implies,  as  I  decreases  slowly,  an  increasing  risk  for  spaghetti 
eaters  from  sauce  detachment!) 

14.  For  the  elastic  bouncer  (see  Fig.  14.7)  between  two  walls  whose  sepa- 
ration is  being  changed  slowly  ('adiabatically')  show  that  the  bouncer 
speed  v  oc  1/L.  We  can  model  (crudely)  a  monatomic  gas,  confined 
within  a  cubical  box  of  side  L,  by  assuming  that  each  molecule  has  the 
same  speed  v  and  that  one  third  of  the  molecules  move  normally  to  each 
pair  of  faces  of  the  box!  If  intermolecular  collisions  do  not  affect  these 
assumptions  then  L  can  be  changed  very  slowly.  Assuming  that  the  ki- 
netic energy  of  the  molecules  gives  a  measure  of  temperature  and  that 
the  gas  is  'ideal',  so  satisfying  the  ideal  gas  law:  (pressure)  x  (volume) 
oc  (temperature),  obtain  the  relationship  (pressure)  oc  (volume)5/3  for 
adiabatic  change  of  a  monatomic  gas. 

15.  A  particle  of  mass  m  moves  smoothly  up  and  down  a  smooth  inclined 
plane  (inclined  at  an  angle  a  to  the  horizontal) .  The  particle  Hamilto- 
nian is  H  =  p2 /2m  +  mgqsma  (=  E)  where  p  =  mq,  the  co-ordinate 
q  >  0,  being  measured  upwards  along  the  plane  from  a  fixed  point 
q  =  0  at  which  the  particle  is  perfectly  elastically  reflected  at  each 
impact.  Show  that  the  energy  E  and  action  /  for  this  oscillator  are 
related  by  E  =  [(9ir2/8)g2  msin2  a]1/3/2/3  and  find  the  frequency  u>  of 
small  oscillation  in  terms  of  g,  a  and  qo  (the  amplitude  of  the  motion). 
Given  that  the  angle  a  now  decreases  very  slowly  use  the  principle 
of  adiabatic  invariance  to  show  that  during  the  long  time  in  which  a 
decreases  from  ty/3  to  ir/6  the  energy  of  the  system  decreases  by  about 
31%  and  the  amplitude  and  period  increase  by  about  20%  and  44% 
respectively. 

16.  *For  central  force  motions  with  an  inverse  square  law  force  of  attrac- 
tion the  relation  between  energy  E  and  actions  I\,  Ii  is  (14.17),  demon- 
strated in  Problem  9.  If  the  strength  of  the  force  (i.e.  \k\)  decreases 
slowly  ('tired  sun'),  use  the  principle  of  adiabatic  invariance  to  show 
that  the  period  r  of  a  bounded  orbit  (an  ellipse,  see  Problem  5)  varies 
so  that  t  oc  k~2  (i.e.  t  increases).  Find  how  the  semi-major  axis  of 
the  ellipse  varies  with  k  and  show  that  the  eccentricity  of  the  ellipse 
remains  constant. 
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Vectors 


In  this  appendix,  we  give  a  summary  of  the  properties  of  vectors  which  are 
used  in  the  text. 


A.l     Definitions  and  Elementary  Properties 

A  vector  a  is  an  entity  specified  by  a  magnitude,  written  a  or  |a|,  and  a 
direction  in  space.  It  is  to  be  contrasted  with  a  scalar,  which  is  specified 
by  a  magnitude  alone.  The  vector  a  may  be  represented  geometrically  by 
an  arrow  of  length  a  drawn  from  any  point  in  the  appropriate  direction.  In 
particular,  the  position  of  a  point  P  with  respect  to  a  given  origin  O  may 
be  specified  by  the  position  vector  r  drawn  from  O  to  P  as  in  Fig.  A.l. 


k 


Fig.  A.l 
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Any  vector  can  be  specified,  with  respect  to  a  given  set  of  Cartesian 
axes,  by  three  components.  If  x,y,z  are  the  Cartesian  co-ordinates  of  P, 
with  O  as  origin,  then  we  write  r  =  (x,y,z),  and  say  that  x,y,z  are  the 
components  of  r.  (See  Fig.  A.l.)  We  often  speak  of  P  as  'the  point  r\ 
When  P  coincides  with  O,  its  position  vector  is  the  zero  vector  0  =  (0, 0,  0) 
of  length  0  and  indeterminate  direction.  For  a  general  vector,  we  write 
a  =  (ax,  ay,  az),  where  ax,  ay,  az  are  its  components. 

The  product  of  a  vector  a  and  a  scalar  c  is  ca  =  (cax,  cay,  caz).  If  c  >  0, 
it  is  a  vector  in  the  same  direction  as  a,  and  of  length  ca;  if  c  <  0,  it  is 
in  the  opposite  direction,  and  of  length  \c\a.  In  particular,  if  c  =  1/a,  we 
obtain  the  unit  vector  in  the  direction  of  o,  a  =  a/a. 

Addition  of  two  vectors  a  and  b  may  be  defined  geometrically  by  draw- 
ing one  vector  from  the  head  of  the  other,  as  in  Fig.  A. 2.  (This  is  the  'par- 
allelogram law'  for  addition  of  forces  —  or  vectors  in  general.)  Subtraction 


a 


t 


a 


Fig.  A.2 


is  defined  similarly  by  Fig.  A. 3.  In  terms  of  components, 


Fig.  A.3 


a  —  b 


a  +  b 


(ax  +  bx,ay  +  by,az  +  bz), 
(ax  -  bx,ay  -  by,az  -  bz). 
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It  is  often  useful  to  introduce  three  unit  vectors  i,j,k,  pointing  in  the 
directions  of  the  x-,  y-,  z-axes,  respectively.  They  form  what  is  known  as 
an  orthonormal  triad  —  a  set  of  three  mutually  perpendicular  vectors  of 
unit  length.  It  is  clear  from  Fig.  A.l  that  any  vector  r  can  be  written  as  a 
sum  of  three  vectors  along  the  three  axes, 

r  =  xi  +  yj  +  zk.  (A.l) 

Mathematically,  any  set  of  three  quantities  may  be  grouped  together  and 
regarded  as  the  components  of  a  vector.  It  is  important  to  realize,  however, 
that  when  we  say  that  some  physical  quantity  is  a  vector  we  mean  more 
than  just  that  it  needs  three  numbers  to  specify  it.  What  we  mean  is  that 
these  three  numbers  must  transform  in  the  correct  way  under  a  change  of 
axes. 

For  example,  consider  a  new  set  of  axes  i',j',k'  related  to  i,j,k  by  a 
rotation  through  an  angle  p>  about  the  z-axis  (see  fig.  A. 4): 


x  i 

Fig.  A. 4 
i'  =  i  cos  (p  +  j  sin  ip, 

j'  =  —  i  sin  ip  +  j  cos  ip,  (A. 2) 

k'  =  k. 

The  co-ordinates  x',y',  z'  of  P  with  respect  to  the  new  axes  are  defined  by 

r  =  x  i  +  y  j  +  z  k  . 

Substituting  (A. 2)  and  comparing  with  (A.l),  we  see  that  x  =  x'  cos  <p  — 
y's'mp,  etc,  or  equivalently  x'  =  xcosip  +  ys'mip,  etc.  Physically,  then,  a 
vector  a  is  an  object  represented  with  respect  to  any  set  of  axes  by  three 
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components  (ax,ay,az)  which  transform  under  rotations  in  the  same  way 
as  (x,y,z),  i.e.,  in  matrix  notation, 


cos  <p    sin  ip  0 

~ax~ 

a'y 

—  sin  tp  cos  ip  0 

ay 

a'z. 

0         0  1. 

.az_ 

A.2    The  Scalar  Product 

If  9  is  the  angle  between  the  vectors  a  and  b,  then  by  elementary  trigonom- 
etry the  length  of  their  sum  is  given  by 

\a  +  b\2  =  a2 +  b2 +  2abcos9.  (A.4) 

It  is  useful  to  define  their  scalar  product  a  ■  b  ('a  dot  b')  as 

a  ■  b  =  ab  cos  9.  (A. 5) 

Note  that  this  is  equal  to  the  length  of  a  multiplied  by  the  projection  of  b 
on  a,  or  vice  versa.  (See  Fig.  A. 5.) 


Fig.  A.5 

In  particular,  the  square  of  a  is 

2  2 

a  =  a  ■  a  =  a  . 

Thus  we  can  write  (A.4)  as 

(a  +  b)2  =  a2  +  b2  +  2a  •  b, 
and,  similarly,  the  square  of  the  difference  is 

(a  -  b)2  =  a2  +  b2  -  2a  •  b. 
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All  the  ordinary  rules  of  algebra  are  valid  for  the  sums  and  scalar  prod- 
ucts of  vectors,  save  one.  (For  example,  the  commutative  law  of  addition, 
a  +  b  =  b  +  a  is  obvious  from  Fig.  A. 2,  and  the  other  laws  can  be  de- 
duced from  appropriate  figures.)  The  one  exception  is  the  following:  for 
two  scalars,  ab  =  0  implies  that  either  a  =  0  or  b  =  0  (or,  of  course,  both), 
but  we  can  find  two  non-zero  vectors  a  and  b  for  which  a  ■  b  =  0.  In  fact, 
this  is  the  case  if  6  =  ir/2,  that  is,  if  the  vectors  are  orthogonal: 

a  ■  b  =  0       if       a  _L  b. 

The  scalar  products  of  the  unit  vectors  i,  j,  k  are 

i2  =  j2  =  k2  =  I,        ij=jk  =  ki  =  Q. 

Thus,  taking  the  scalar  product  of  each  in  turn  with  (A.l),  we  find 

i  ■  r  =  x,        j  ■  r  =  y,        k  ■  r  =  z. 

These  relations  express  the  fact  that  the  components  of  r  are  equal  to  its 
projections  on  the  three  co-ordinate  axes. 

More  generally,  if  we  take  the  scalar  product  of  two  vectors  a  and  b,  we 
find 

a  ■  b  =  axbx  +  ayby  +  azbz,  (A. 6) 

and  in  particular, 

r2  =  r2  =  x2  +y2  +  z2.  (A.7) 


A.3    The  Vector  Product 

Any  two  non-parallel  vectors  a  and  b  drawn  from  O  define  a  unique  axis 
through  O  perpendicular  to  the  plane  containing  a  and  b.  It  is  useful  to 
define  the  vector  product  aAb  ('a  cross  b\  sometimes  also  written  a  x  b)  to 
be  a  vector  along  this  axis  whose  magnitude  is  the  area  of  the  parallelogram 
with  edges  a  and  b, 

\a  A  b\  =  ab sinO.  (A. 8) 

(See  Fig.  A. 6.)  To  distinguish  between  the  two  opposite  directions  along 
the  axis,  we  introduce  a  convention:  the  direction  of  a  A  b  is  that  in  which 
a  right-hand  screw  would  move  when  turned  from  o  to  6. 
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i  i 

aAb 


Fig.  A.6 

A  vector  whose  sense  is  merely  conventional,  and  would  be  reversed  by 
changing  from  a  right-hand  to  a  left-hand  convention,  is  called  an  axial 
vector,  as  opposed  to  an  ordinary,  or  polar,  vector.  For  example,  velocity 
and  force  are  polar  vectors,  but  angular  velocity  is  an  axial  vector  (see 
§5.1).  The  vector  product  of  two  polar  vectors  is  thus  an  axial  vector. 

The  vector  product  has  one  very  important,  but  unfamiliar,  property. 
If  we  interchange  a  and  b,  we  reverse  the  sign  of  the  vector  product: 

bAa  =  -aAb.  (A. 9) 

It  is  essential  to  remember  this  fact  when  manipulating  any  expression 
involving  vector  products.  In  particular,  the  vector  product  of  a  vector 
with  itself  is  the  zero  vector, 

a  A  a  =  0. 

More  generally,  aAb  vanishes  if  6  =  0  or  it: 

a  A  b  =  0       if       a  ||  b. 

If  we  choose  our  co-ordinate  axes  to  be  right-handed,  then  the  vector 
products  of  i,j,k  are 


i  A  i 

=  J  A  j 

=  k  A  k  = 

o, 

iAj 

=  k, 

j  Ai  = 

-k 

j  A  k 

=  i, 

k  A  j  = 

-i, 

k  A  i 

=  3, 

i  A  k  = 

-3 
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Thus,  when  we  form  the  vector  product  of  two  arbitrary  vectors  a  and  b, 
we  obtain 

a  A  b  =  i(aybz  -  azby)  +  j(azbx  —  axbz)  +  k{axbv  —  aybx). 
This  relation  may  conveniently  be  expressed  in  the  form  of  a  determinant 


a  A  b 


i   j  k 

&x  Q"y 

bx  by  bz 


(A.ll) 


From  any  three  vectors  a,  b,  c,  we  can  form  the  scalar  triple  product 
(a  A  b)  ■  c.  Geometrically,  it  represents  the  volume  V  of  the  parallelepiped 
with  adjacent  edges  a,b,c  (see  Fig.  A. 7).  For,  if  ip  is  the  angle  between  c 


aAb 


Fig.  A.7 


and  a  A  b,  then 


(a  A  b)  ■  c  =  \a  A  b|ccos<^  =  Ah  =  V, 

where  A  is  the  area  of  the  base,  and  h  =  c  cos  ip  is  the  height.  The  volume 
is  reckoned  positive  if  a,  b,  c  form  a  right-handed  triad,  and  negative  if  they 
form  a  left-handed  triad.  For  example,  (i  A  j)  ■  k  =  1,  but  (i  A  k)  ■  j  =  —1. 

In  terms  of  components,  we  can  evaluate  the  scalar  triple  product  by 
taking  the  scalar  product  of  c  with  (A.ll).  We  find 


(a  A  b)  ■  c  : 


'-*Jt 

bx  b„  b2 


(A.12) 


Either  from  this  formula,  or  from  its  geometric  interpretation,  we  see 
that  the  scalar  triple  product  is  unchanged  by  any  cyclic  permutation  of 
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a,  b,  c,  but  changes  sign  if  any  pair  is  interchanged: 


(a  A  b)  ■  c  =  (b  A  c)  •  a  =  (c  A  a)  •  b 

=  -  (b  A  a)  •  c  =  -  (c  A  6)  •  a 


(a  A  c)  •  6. 


(A.13) 


Moreover,  we  may  interchange  the  dot  and  the  cross: 


(a  A  6)  •  c  =  a  •  (b  A  c). 


(A.14) 


(For  this  reason,  a  more  symmetrical  notation,  [a,  b,  c],  is  sometimes  used.) 

Note  that  the  scalar  triple  product  vanishes  if  any  two  vectors  are  equal, 
or  parallel.  More  generally,  it  vanishes  if  a,  b,  c  are  coplanar. 

From  three  vectors  we  can  also  form  the  vector  triple  product  (a  A  b)  A  c. 
Since  this  vector  is  perpendicular  to  a  A  6,  it  must  lie  in  the  plane  of  a  and 
b,  and  must  therefore  be  a  linear  combination  of  these  two  vectors.  It  is 
not  hard  to  show  by  writing  out  the  components,  that 


Note  that  these  vectors  are  unequal,  so  that  we  cannot  omit  the  brackets  in 
a  vector  triple  product.  It  is  useful  to  note  that  in  both  of  these  formulae 
the  term  with  the  positive  sign  is  the  middle  vector  times  the  scalar  product 
of  the  other  two. 

A. 4    Differentiation  and  Integration  of  Vectors 

We  are  often  concerned  with  vectors  which  are  functions  of  some  scalar 
parameter,  for  example  the  position  vector  of  a  particle  as  a  function  of 
time,  r(t).  The  vector  distance  travelled  by  the  particle  in  a  short  time 
interval  At  is 


(a  A  b)  A  c  =  (a  ■  c)b  -  (b  ■  c)a. 


(A.15) 


Similarly, 


a  A  (b  A  c)  =  (a  ■  c)b  -  (a  ■  b)c. 


(A.16) 


Ar  =  r(t  +  At)  -  r(t). 


(See  Fig.  A. 8.)  The  velocity,  or  derivative  of  r  with  respect  to  t,  is  defined 
just  as  for  scalars,  as  the  limit  of  a  ratio, 
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Fig.  A.8 


In  the  limit,  the  direction  of  this  vector  is  that  of  the  tangent  to  the  path 
of  the  particle,  and  its  magnitude  is  the  speed  in  the  usual  sense.  In  terms 
of  co-ordinates, 

r  =  (x,y,z). 

Derivatives  of  other  vectors  are  defined  similarly.  In  particular,  we  can 
differentiate  again  to  form  the  acceleration  vector  r  =  d2r/dt2. 

It  is  easy  to  show  that  all  the  usual  rules  for  differentiating  sums  and 
products  apply  also  to  vectors.  For  example, 

d  .       , .      da  db 
—  (a  A  o)  =  — —  Ao  +  aA  — , 

dV        '      dt  dt 

though  in  this  particular  case  one  must  be  careful  to  preserve  the  order  of 
the  two  factors,  because  of  the  antisymmetry  of  the  vector  product. 

Note  that  the  derivative  of  the  magnitude  of  r,  dr/dt,  is  not  the  same 
thing  as  the  magnitude  of  the  derivative,  |dr/dt|.  For  example,  if  the 
particle  is  moving  in  a  circle,  r  is  constant,  so  that  r  =  0,  but  clearly  |f|  is 
not  zero.  In  fact,  applying  the  rule  for  differentiating  a  scalar  product  to 


2rr  =  -^(r2)  =  ^-(r2)  =  2rr, 

dV    '      dV  ' 

which  may  also  be  written 

r  =  rr.  (A. 18) 


Tims  the  rate  of  change  of  the  distance  r  from  the  origin  is  equal  to  the 
radial  component  of  the  velocity  vector. 
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We  can  also  define  the  integral  of  a  vector.  If  v  =  dr/dt,  then  we  also 
write 


and  say  that  r  is  the  integral  of  v.  If  we  are  given  v(t)  as  a  function  of 
time,  and  the  initial  value  of  r,  r(to),  then  the  position  at  any  later  time 
is  given  by  the  definite  integral 


(A.19) 


This  is  equivalent  to  three  scalar  equations  for  the  components,  for  example, 
x(h)  =  x(t0)  +  /  vx(t)dt. 

J  to 

One  can  show,  exactly  as  for  scalars,  that  the  integral  in  (A.19)  may  be 
expressed  as  the  limit  of  a  sum. 


A. 5     Gradient,  Divergence  and  Curl 

There  are  many  quantities  in  physics  which  are  functions  of  position  in 
space;  for  example,  temperature,  gravitational  potential,  or  electric  field. 
Such  quantities  are  known  as  fields.  A  scalar  field  is  a  scalar  function 
4>(x,  y,  z)  of  position  in  space;  a  vector  field  is  a  vector  function  A(x,  y,  z). 
We  can  also  indicate  the  position  in  space  by  the  position  vector  r  and 
write  <p(r)  or  A(r). 

Now  let  us  consider  the  three  partial  derivatives  of  a  scalar  field, 
d(j)/dx,  d(j>/dy,  d<p/dz.  They  form  the  components  of  a  vector  field,  known 
as  the  gradient  of  0,  and  written  grad</>,  or  V0  ('del  <j>\  or  occasionally 
'nabla  </>').  To  show  that  they  really  are  the  components  of  a  vector,  we 
have  to  show  that  it  is  defined  in  a  manner  which  is  independent  of  the 
choice  of  axes.  We  note  that  if  r  and  r  +  dr  are  two  neighbouring  points, 
then  the  difference  between  the  values  of  0  at  these  points  is 

d<f>  =  Mr  +  dr)  -  Mr)  =  ^  dx  +  ^  dy  +  ^  dz  =  dr  ■  V<j>.  (A.20) 

ox         ay  oz 

Now,  if  the  distance  |dr|  is  fixed,  then  this  scalar  product  takes  on  its 
maximum  value  when  dr  is  in  the  direction  of  V0.  Hence  we  conclude 
that  the  direction  of  V0  is  the  direction  in  which  <f>  increases  most  rapidly. 
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Moreover,  its  magnitude  is  the  rate  of  increase  of  <j>  with  distance  in  this 
direction.  (This  is  the  reason  for  the  name  'gradient'.)  Clearly,  therefore, 
we  could  define  V0  by  these  properties,  which  are  independent  of  any 
choice  of  axes. 

We  are  often  interested  in  the  value  of  the  scalar  field  <f>  evaluated  at 
the  position  of  a  moving  particle,  <j)(r(t)).  From  (A. 20)  it  follows  that  the 
rate  of  change  of  0  is 


d4>(r(t)) 
dt 


r  •  V0. 


(A.21) 


The  symbol  V  may  be  regarded  as  a  vector  which  is  also  a  differential 
operator  (like  d/dx),  given  by 

V      '  ^       '  ^      k  ^ 

1  dx     ^  dy  dz' 

We  can  also  apply  it  to  a  vector  field  A.  The  divergence  of  A  is  defined  to 
be  the  scalar  field 

dAx  .  dAv  .  dAz 
dx 

and  the  curl  of  A  to  be  the  vector  field 


dWA  =  V-A=^  +  —^  +  —^,  (A.22 

dy  dz 


curl  A  =  V  A  A 


i    j  k 

Odd 
dx  dy  dz 
Ax  Ay  Az 


(A.23) 


This  latter  expression  is  an  abbreviation  for  the  expanded  form 


V  A  A 


dAz 
dy 


dz 


+  3 


dAx 
dz 


dA, 
dx 


+  k 


dAy  dAx 


dx 


dy 


(Instead  of  curl  A,  the  alternative  notation  rot  A  is  sometimes  used,  par- 
ticularly in  non-English-speaking  countries.) 

To  understand  the  physical  significance  of  these  operations,  it  is  helpful 
to  think  of  the  velocity  field  in  a  fluid:  v(r)  is  the  fluid  velocity  at  the  point 
r. 

Let  us  consider  a  small  volume  of  fluid,  5V  =  5x  5y  Sz,  and  try  to  find  its 
rate  of  change  as  it  moves  with  the  fluid.  Consider  first  the  length  Sx.  To  a 
first  approximation,  over  a  short  time  interval  dt,  the  velocity  components 
in  the  y  and  z  directions  are  irrelevant;  the  length  5x  changes  because 
the  x  components  of  velocity,  vx,  at  its  two  ends  are  slightly  different,  by 
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an  amount  (dvx/dx)5x.  Thus  in  a  time  dt,  the  change  in  Sx  is  dSx 
(dvx/dx)  Sxdt,  whence 

dSx  dvx 
dt  dx 

Taking  account  of  similar  changes  in  5y  and  Sz,  we  have 

*(8x8v8z)=(^  +  ^  +  Z£)  SxSySz, 
or,  equivalently, 

dSV 


dt 


(V  •  v)  6V.  (A.24) 


Tims  V  •  v  represents  the  proportional  rate  of  increase  of  volume:  positive 
V  •  v  means  expansion,  negative  V  •  v  compression.  In  particular,  if  the 
fluid  is  incompressible,  then  V  •  v  =  0. 

It  is  possible  to  show  in  a  similar  way  that  a  non-zero  VA«  means  that 
locally  the  fluid  is  rotating.  This  vector,  called  the  vorticity,  represents  the 
local  angular  velocity  of  rotation  (times  2;  see  Problem  10). 

The  rule  for  differentiating  products  can  also  be  applied  to  expressions 
involving  V.  For  example,  V  •  (A  A  B)  is  a  sum  of  two  terms,  in  one  of 
which  V  acts  on  A  only  and  in  the  other  on  B  only.  The  gradient  of  a 
product  of  scalar  fields  can  be  written 

V(<M>)  =  ipVcf)  +  </>VV>, 

and  similarly 

V  •  (</>A)  =  A  ■  V<f>  +  eW  •  A. 

But,  when  vector  products  are  involved,  we  have  to  remember  that  the 
order  of  the  factors  as  a  product  of  vectors  cannot  be  changed  without 
affecting  the  signs.  Thus  we  have 

V  ■  {A  A  B)  =  B  ■  (V  A  A)  -  A  ■  (V  A  B), 

and,  similarly, 


V  A  {4>A)  =  4>{V  A  A)  -  A  A  (V<f>). 
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We  may  apply  the  vector  differential  operator  V  twice.  The  divergence 
of  the  gradient  of  a  scalar  field  <p  is  called  the  Laplacian  of  4>, 

„2l     „  d2<f>     d2(t>     d2<j>  . ,  . 

V^  =  V.W=^f  +  ^  +  ^f.  (A.25) 

Some  operations  always  give  zero.  Just  as  a  A  a  =  0,  we  find  that  the 
curl  of  a  gradient  vanishes, 

VAV?!>  =  0.  (A.26) 

For  example,  its  z  component  is 

d_(df\_d_(df\=Q 
dx  \dy J     dy  \dx J 

Similarly,  one  can  show  that  the  divergence  of  a  curl  vanishes: 

V  •  (V  A  A)  =  0.  (A.27) 

An  important  identity,  analogous  to  the  expansion  of  the  vector  triple 
product  (A. 16),  gives  the  curl  of  a  curl, 

VA(VAA)  =  V(V-A)- V2A,  (A.28) 

where  of  course 

2  A      d2A     d2A  d2A 
V  A  =  1  1  . 

dx2      dy2  dz2 

It  may  easily  be  proved  by  inserting  the  expressions  in  terms  of  components. 


A. 6    Integral  Theorems 

There  are  three  important  theorems  for  vectors  which  are  generalizations 
of  the  fundamental  theorem  of  the  calculus, 

/(    ^'l-<-     A-';:  .A-''.::- 

First,  consider  a  curve  C  in  space,  running  from  t*o  to  r\  (see  Fig.  A. 9). 
Let  the  directed  element  of  length  along  C  be  dr.  If  (f>  is  a  scalar  field, 
then  according  to  (A.  20),  the  change  in  <f>  along  this  element  of  length  is 
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Fig.  A.9 


d(f>  =  dr  ■  V(/>.  Thus,  integrating  from  ro  to  n,  we  obtain  the  first  of  the 
integral  theorems, 


The  integral  on  the  left  is  called  the  line  integral  of  'Vcj)  along  C.  (Note 
that,  as  here,  it  is  often  more  convenient  to  place  the  differential  symbol 
dr  to  the  left  of  the  integrand.) 

This  theorem  may  be  used  to  relate  the  potential  energy  function  V(r) 
for  a  conservative  force  to  the  work  done  in  going  from  some  fixed  point 
ro,  where  V  is  chosen  to  vanish,  to  r.  Thus,  if  F  =  —'VV,  then 


When  F  is  conservative,  this  integral  depends  only  on  its  end-points,  and 
not  on  the  path  C  chosen  between  them.  Conversely,  if  this  condition  is 
satisfied,  we  can  define  V  by  (A.30),  and  the  force  must  be  conservative. 
The  condition  that  two  line  integrals  of  the  form  (A.30)  should  be  equal 
whenever  their  end-points  coincide  may  be  restated  by  saying  the  the  line 
integral  round  any  closed  path  should  vanish.  Physically,  this  means  that 
no  work  is  done  in  taking  a  particle  round  a  loop  which  returns  to  its 
starting  point.  The  integral  round  a  closed  loop  is  usually  denoted  by  the 
symbol  §c.  Thus  we  require 


for  all  closed  loops  C. 

This  condition  may  be  simplified  by  using  the  second  of  the  integral 
theorems  —  Stokes'  theorem.  Consider  a  curved  surface  S,  bounded  by  the 
closed  curve  C.  If  one  side  of  S  is  chosen  to  be  the  'positive'  side,  then  the 


(A.29) 


(A.30) 


(A.31) 
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positive  direction  round  C  may  be  defined  by  the  right-hand-screw  conven- 
tion (see  Fig.  A.  10).  Take  a  small  element  of  the  surface,  of  area  dS,  and  let 


Fig.  A.10 

n  be  a  unit  vector  normal  to  the  element,  and  directed  towards  its  positive 
side.  Then  the  directed  element  of  area  is  defined  to  be  dS  =  ndS.  Stokes' 
theorem  states  that  if  A  is  any  vector  field,  then 


dS  •  (V  A  A)  =  d>  dr-A.  (A.32) 
s  Jc 

The  application  of  this  theorem  to  (A. 31)  is  immediate.  If  the  line  integral 
round  C  is  required  to  vanish  for  all  closed  curves  C,  then  the  surface  inte- 
gral must  vanish  for  all  surfaces  S.  But  this  is  only  possible  if  the  integrand 
vanishes  identically.  So  the  condition  for  a  force  to  be  conservative  is 

V  A  F  =  0.  (A.33) 

We  shall  not  prove  Stokes'  theorem.  However,  it  is  easy  to  verify  for  a 
small  rectangular  surface.  (The  proof  proceeds  by  splitting  up  the  surface 
into  small  sub-regions.)  Suppose  S  is  a  rectangle  in  the  icy-plane,  of  area 
dxdy.  Then  dS  =  kdxdy,  so  the  surface  integral  is 

k-(VAA)dxdy=  HL±-^L±\  dxdy.  (A.34) 

The  line  integral  consists  of  four  terms,  one  from  each  edge.  The  two  terms 
arising  from  the  edges  parallel  to  the  a;-axis  involve  the  x  component  of  A 
evaluated  for  different  values  of  y.  They  therefore  contribute 

dA 

Ax(y)  dx  -  Ax(y  +  dy)  dx  =  — -^dxdy. 
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Similarly,  the  other  pair  of  edges  yields  the  first  term  of  (A. 34). 

One  can  also  find  a  necessary  and  sufficient  condition  for  a  field  B(r) 
to  have  the  form  of  a  curl, 

B  =  VAA 

By  (A.  27),  such  a  field  must  satisfy 

V  ■  B  =  0.  (A.35) 

The  proof  that  this  is  also  a  sufficient  condition  (which  we  shall  not  give 
in  detail)  follows  much  the  same  lines  as  before.  One  can  show  that  it 
is  sufficient  that  the  surface  integral  of  B  over  any  closed  surface  should 
vanish: 


dS  ■  B  =  0,       (S  closed) 

s 

and  then  use  the  third  of  the  integral  theorems,  Gauss'  theorem.  This  states 
that  if  V  is  a  volume  in  space  bounded  by  the  closed  surface  S,  then  for 
any  vector  field  B, 

JJJ  dVV-B  =  J  J  dSB,  (A.36) 

where  dV  denotes  the  volume  element  dV  =  dxdydz,  and  the  positive  side 
of  S  is  taken  to  be  the  outside. 

It  is  again  easy  to  verify  Gauss'  theorem  for  a  small  rectangular  volume, 
dV  =  dxdydz.  The  volume  integral  is 

dBx     8BV     8BZ\  ,    ,  , 

af  +  ^  +  ^Jd*d^  (A.37) 

The  surface  integral  consists  of  six  terms,  one  for  each  face.  Consider  the 
faces  parallel  to  the  :rt/-plane,  with  directed  surface  elements  kdxdy  and 
— kdxdy.  Their  contributions  involve  k  ■  B  =  Bz,  evaluated  for  different 
values  of  z.  Thus  they  contribute 

OB 

Bz  (z  +  dz)  dx  dy  —  Bz  (z)  dx  dy  =         dx  dy  dz. 


Similarly,  the  other  terms  of  (A.37)  come  from  the  other  pairs  of  faces. 
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A. 7    Electromagnetic  Potentials 

An  important  application  of  these  theorems  is  to  the  electromagnetic  field. 

The  basic  equations  of  electromagnetic  theory  are  Maxwell's  equations. 
In  the  absence  of  dielectric  or  magnetic  media,  they  may  be  expressed  in 
terms  of  two  fields,  the  electric  field  E  and  the  magnetic  field  B.  There  is 
one  pair  of  homogeneous  equations, 

r)  R 

VAE+  —  =  0,       V-B  =  0,  (A.38) 

and  a  second  pair  involving  also  the  electric  charge  density  p  and  current 
density  j, 

0E 

^VAB-eo—  =j,       e0V-E  =  p,  (A.39) 

in  which  po  and  eo  are  universal  constants. 

The  second  equation  in  (A.38)  is  just  the  condition  (A. 35).  It  follows 
that  there  must  exist  a  vector  potential  A  such  that 

B  =  V  A  A.  (A.40) 

Then,  substituting  in  the  first  of  the  equations  (A.38),  we  find  that  V  A 
(E  +  dA/dt)  =  0.  It  follows  that  there  must  exist  a  scalar  potential  <f>  such 
that 

E  =  -V<j>-^.  (A.41) 

These  potentials  are  not  unique.  If  A  is  any  scalar  field,  then  the  po- 
tentials 

0'  =  «/>+^,       A'  =  A-VA  (A.42) 

define  the  same  fields  E  and  B  as  do  <j>  and  A.  This  is  called  a  gauge 
transformation.  We  may  eliminate  this  arbitrariness  by  imposing  an  extra 
condition,  for  example  the  radiation  gauge  (or  Coulomb  gauge)  condition 

V  •  A  =  0.  (A.43) 

In  the  static  case,  where  all  the  fields  are  time-independent,  Maxwell's 
equations  separate  into  a  pair  of  electrostatic  equations,  and  a  magneto- 
static  pair.  Then  <j>  becomes  the  ordinary  electrostatic  potential,  satisfying 


398 


Classical  Mechanics 


Poisson's  equation  (6.48).  The  vector  potential,  by  (A. 39)  and  (A. 40)  sat- 
isfies 

V  A  (V  A  A)  =  noj. 

Using  (A. 28),  and  imposing  the  radiation  gauge  condition  (A. 43),  we  find 

V2A  =  -p0j.  (A.44) 

This  is  the  analogue  of  Poisson's  equation.  The  solution  is  of  the  same  form 
as  (6.15),  namely 

^  =  ?JJI^'<-  <"»> 

Tims,  given  any  static  distribution  of  charges  and  currents,  we  may  calcu- 
late the  potentials  <p  and  A,  and  hence  the  fields  E  and  B. 


A. 8     Curvilinear  Co-ordinates 

Another  use  of  the  integral  theorems  is  to  provide  expressions  for  the  gra- 
dient, divergence  and  curl  in  terms  of  curvilinear  co-ordinates. 

Consider  a  set  of  orthogonal  curvilinear  co-ordinates  (see  §3.5)  q±,  q%,  q-j- 
Let  us  denote  the  elements  of  length  along  the  three  co-ordinate  curves  by 
hi  dqi,  hi  d(j2,  h$  dq-A.  For  example,  in  cylindrical  polars 

hp  =  1,        hv  =  p,        hz  =  1,  (A.46) 

while  in  spherical  polars 

hr  =  1,        hg  =  r,        hv  =  rs'm9.  (A. 47) 

Now  consider  a  scalar  field  tp,  and  two  neighbouring  points  (91,52,93)  and 
(qi  +d<ji,  92,  93).  Then  the  difference  between  the  values  of  tp  at  these  points 
is 

dffi  =  dtp  =  dr  ■  V'</>  =  hi  dqi  (VV>)i, 

dqi 

where  (Vi/>)i  is  the  component  of  'Vtp  in  the  direction  of  increasing  q\. 
Hence  we  find 
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with  similar  expressions  for  the  other  components.  Thus  in  cylindrical  and 
spherical  polars,  we  have 

V*-f*i£,£Y  (A. 49) 


dp'  p  dip'  dz 


and 


7T' ^~^7T    •  A-50 

\or  r  do  r  sin  &  dip  J 

To  find  an  expression  for  the  divergence,  we  use  Gauss'  theorem,  applied 
to  a  small  volume  bounded  by  the  co-ordinate  surfaces.  The  volume  integral 
is 

(V  •  A)  hi  Aqi  h2  dq2  h3  dq3. 

In  the  surface  integral,  the  terms  arising  from  the  faces  which  are  surfaces 
of  constant  q3  are  of  the  form  A3h\  dqi  h2  dq2,  evaluated  for  two  different 
values  of  q3 .  They  therefore  contribute 

d 

-^—{hih2Az)  dqi  dq2  dq3. 
dq3 

Adding  the  terms  from  all  three  pairs  of  faces,  and  comparing  with  the 
volume  integral,  we  obtain 

hih2h3  \      dqx        '        dq2  dq3      )  ' 

In  particular,  in  cylindrical  and  spherical  polars, 

V.A=i^  +  i^  +  ^,  (A.52) 
pop        p  dip  dz 

and 

v  A_l  d(r^Ar)  |      1     d(Sm9Ae)  |      1  dA^ 
r2     dr         rsin9       dO  rsm6  dp  ' 

To  find  the  curl,  we  use  Stokes'  theorem  in  a  similar  way.  Let  us  consider 
a  small  element  of  a  surface  q3  =  constant,  bounded  by  curves  of  constant 
qi  and  of  q2.  Then  the  surface  integral  is 

(V  A  A)3  h\  dqi  h2  dq2. 
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In  the  line  integral  around  the  boundary,  the  two  edges  of  constant  q2 
involve  A\h\  dq\  evaluated  for  different  values  of  q2,  and  so  contribute 

d 

-—- (hiAi)dqi  dq2. 
dq2 

Hence,  adding  the  contribution  from  the  other  pair  of  edges,  we  obtain 

1    (d(h2A2)  0(hii4i) 


(V  A  A)3 


h\h2 


dqi 


dq2 


(A.54) 


with  similar  expressions  for  the  other  components.  Thus,  in  particular,  in 
cylindrical  and  spherical  polars 


V  A  A 

and 

VA  A- 


ldAz  _  dAp_  dAp  _  dA^  1 

p  dtp       dz  '  dz       dp  '  p 

1     [^(sing  Ay)  _  dAe 
rsin#  |_       d6  dip  J 

f     dAr     1  d{rAv)  1 
rsin6*  9<p      r     dr     '  r 


dAp 
dp  dip 


(A.55) 


dQAe)  _  dAr 
dr  df 


(A.56) 


Finally,  combining  the  expressions  for  the  divergence  and  gradient,  we 
can  find  the  Laplacian  of  a  scalar  field.  It  is 


V2'(/!> 


hih2h3 


d 


(  h2h3  dtp 


+  ■ 


h3h\  dtp 


dqi  V  hi   dqi  J      dq2  \  h2  dq2 

d    (h\h2  dip 


dqz  V  h3  dq3 


(A.57) 


In  cylindrical  polars, 


and,  in  spherical  polars, 


2  ,      1  d  (  dip\      1  d2tp  d2ip 


p  dp  \  dp  )     p2  dp 


dz2 


l  d 


r2  dr 


,di> 


dr 


I   d_ 

r2  sin  6  d9 


sin  6^  I 


d2i> 


d9  J      r2  sin2  d  dp2  ' 


(A.58) 


(A.59) 
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A. 9  Tensors 

Scalars  and  vectors  are  the  first  two  members  of  a  family  of  objects  known 
collectively  as  tensors,  and  described  by  1,  3,  9,  27, . . .  components.  Scalars 
and  vectors  are  called  tensors  of  valence  0  and  valence  1,  respectively. 
(Sometimes  the  word  rank  is  used  instead  of  'valence',  but  there  is  then 
a  possibility  of  confusion  with  a  different  usage  of  the  same  word  in  matrix 
theory.) 

In  this  section,  we  shall  be  concerned  with  the  next  member  of  the 
family,  the  tensors  of  valence  2,  often  called  dyadics.  We  shall  use  the  word 
'tensor'  in  this  restricted  sense,  to  mean  a  tensor  of  valence  2. 

Tensors  occur  most  frequently  when  one  vector  b  is  given  as  a  linear 
function  of  another  vector  a,  according  to  the  matrix  equation 


~bx 

~T    T  T 

-1- XX  J  xy  ±xz 

ax 

by 

TyX    Tyy  TyZ 

ay 

(A.60) 

TZx  TZy  Tzz_ 

_az_ 

An  example  is  the  relation  (9.17)  between  the  angular  momentum  J  and 
angular  velocity  a;  of  a  rigid  body. 

The  nine  elements  of  the  3x3  matrix  in  (A.60)  are  the  components  of 
a  tensor,  which  we  shall  denote  by  the  sans-serif  capital  T.  By  an  obvious 
extension  of  the  dot  product  notation  for  the  scalar  product  of  two  vectors, 
we  may  write  (A.60)  as 

b  =  T  a.  (AM) 

For  example,  (9.17)  may  be  written  J  =  I  •  w,  where  I  is  the  inertia  tensor. 

We  can  go  on  to  form  the  scalar  product  of  (A. 61)  with  another  vector, 
c,  obtaining  a  scalar,  c  •  T  •  a.  Note  that  in  general  this  is  not  the  same  as 
a  T  ■  c.  In  fact, 

a  T  c  =  c  f  ■  a,  (A.62) 

where  T  is  the  transposed  tensor  of  T,  obtained  by  reflecting  in  the  leading 
diagonal,  e.g.,  fxy  =  Tyx. 

The  tensor  T  is  called  symmetric  if  T  =  T,  i.e.,  if  Tji  =  Tjj  for  all  i,j. 
It  is  antisymmetric  if  T  =  — T,  or  Tji  =  —Tij  for  all 

An  interesting  example  of  an  antisymmetric  tensor  is  provided  by  the 
relation  (5.2)  giving  the  velocity  v  as  a  function  of  position  r  in  a  body 
rotating  with  angular  velocity  u>.   It  is  a  linear  relation  and  so  may  be 
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written  in  the  form  (A. 60),  specifically  as 


v. 


0         —  UJZ  UJy 

Ulz        0  —Ul: 

-UJy       LOx  0 


X 


V 


z 


There  is  an  important  special  tensor, 


1  = 


1  0  0' 
0  f  0 
0  0  f 


called  the  unit  tensor  or  identity  tensor,  with  the  property  that  1  •  a  =  a 
for  all  vectors  a. 

From  any  two  vectors  a  and  b,  we  can  form  a  tensor  T  by  multiplying 
their  elements  together  (without  adding),  i.e.,  Tij  =  cubj.  This  is  the  tensor 
product  (or  dyadic  product  or  outer  product)  of  a  and  b,  written  T  =  ab, 
with  no  dot  or  cross.  Note  that  T-c  =  (ab)  c  =  a(b-c),  so  the  brackets  are 
in  fact  unnecessary.  In  matrix  notation,  ab  is  the  product  of  the  column 
vector  a  and  the  row  vector  b,  while  the  scalar  product  (or  inner  product) 
a  ■  b  is  the  row  a  times  the  column  b. 

We  can  deduce  the  correct  transformation  law  of  a  tensor  under  a  rota- 
tion of  axes:  its  components  transform  just  like  the  products  of  components 
of  two  vectors.  If  we  symbolize  (A. 2)  formally  as  a'  =  R  a,  then  the  correct 
transformation  law  of  a  tensor  is  T'  =  R  •  T  •  R.  (This  denotes  a  product  of 
three  3x3  matrices.) 

The  use  of  the  tensor  product  allows  us  to  write  some  old  results  in  a 
new  way.  For  example,  for  any  vector  a, 

1  •  a  =  a  =  i(i  ■  a)  +  j(j  ■  a)  +  k(k  ■  a)  =  (ii  +  jj  +  kk)  ■  a, 


as  may  easily  be  verified  by  writing  out  the  components. 

Similarly,  we  may  write  the  relation  (9.16)  between  angular  momentum 
and  angular  velocity  in  the  form 


whence 


H  +  jj  +  kk 


1, 


(A.63) 


where  the  inertia  tensor  I  is  given  explicitly  by 
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Note  the  difference  between  the  unit  tensor  1  and  the  inertia  tensor  I.  It 
is  easy  to  check  that  the  nine  components  of  this  equation  reproduce  the 
relations  (9.15). 

Note  that  if  T  =  ab,  then  T  =  ba,  whence  in  particular  the  inertia 
tensor  I  is  symmetric. 

A.  10    Eigenvalues;  Diagonalization  of  a  Symmetric  Tensor 

In  this  section,  we  discuss  a  theorem  that  has  very  wide  applicability. 

Let  T  be  a  symmetric  tensor.  A  vector  a  is  called  an  eigenvector  of  T, 
with  eigenvalue  A,  if 

T  •  a  =  Xa,  (A.64) 

or,  equivalently  (T  —  Al)  •  a  =  0.  (Compare  (11.17),  which  is  also  an  eigen- 
value equation.)  The  condition  for  the  existence  of  a  non-trivial  solution  is 
that  the  determinant  of  the  coefficients  vanishes, 


T  — 

±  XX 

A 

T 

±  xz 

det(T  -  Al)  = 

T 

J-yx 

T  - 

±vv 

A  Tyz 

=  0. 

T 

'  zx 

T 

TZz  —  A 

This  is  a  cubic  equation  for  A.  Its  three  roots  are  either  all  real,  or  else  one 
real  and  one  complex  conjugate  pair.  However,  for  a  symmetric  tensor  T 
with  real  elements  the  latter  possibility  can  be  ruled  out. 

To  see  this,  suppose  that  A  is  a  complex  eigenvalue,  and  let  a  be  the 
corresponding  eigenvector,  whose  components  may  also  be  complex.  Now, 
taking  the  complex  conjugate  of  Ta  =  Xa,  we  obtain  Ta*  =  A* a*,  where 
A*  denotes  the  complex  conjugate  of  A,  and  a*  =  (a*,a*,a*z).  Multiplying 
these  two  equations  by  a*  and  a  respectively,  we  obtain 

a*  ■  T  •  a  =  Xa*  ■  a,        and       a  ■  T  •  a*  =  X*a  ■  a* . 

But  since  T  is  symmetric,  the  left-hand  sides  of  these  equations  are  equal, 
by  (A. 62).  Hence  the  right-hand  sides  must  be  equal  too.  Since  a*  ■  a  = 
\a-x\2  +  \ay\2  +  \az\2  =  a  ■  a*,  this  means  that  A*  =  A,  i.e.,  X  must  be  real. 

Thus  we  have  shown  that  there  are  three  real  eigenvalues,  say  Ai,  A2,  A3, 
and  three  corresponding  real  eigenvectors,  a±,  02, 03.  (We  consider  the  case 
where  two  eigenvalues  are  equal  below.)  Next,  we  show  that  the  eigenvec- 
tors are  orthogonal.  For,  if 


T  •  ai  =  X\a\,       T  •  02  =  A2<i2, 
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then,  multiplying  the  first  equation  by  02  and  the  second  by  01,  and  again 
using  the  symmetry  of  T,  we  obtain 


Thus  if  Ai  7^  A2,  then  ai  •      =  0. 

If  all  three  eigenvalues  are  distinct,  then  the  three  eigenvectors  are  or- 
thogonal. Moreover,  it  is  clear  that  if  a  is  an  eigenvector,  then  so  is  any  mul- 
tiple of  a,  so  that  we  may  choose  to  normalize  it,  defining  ei  =  a\ja\.  Then 
the  three  normalized  eigenvectors  form  an  orthonormal  triad,  ei,e2,e3.  If 
we  choose  these  as  axes,  then  T  must  take  the  diagonal  form 


For,  since  e\  =  (1,  0,  0),  T  •  e\  is  simply  the  first  column  of  T,  and  this  must 
be  Aiei  =  (Ai,  0,  0).  Similarly  for  the  other  columns. 

This  relationship  between  T  and  the  eigenvectors  may  also  be  ex- 
pressed, using  the  tensor-product  notation,  in  a  co-ordinate-independent 
form,  namely 


Finally,  we  have  to  show  that  these  results  still  hold  if  two  or  three  eigen- 
values coincide.  The  simplest  way  to  do  this  is  to  add  a  small  quantity  e  to 
one  of  the  diagonal  components  of  T,  to  make  the  eigenvalues  slightly  differ- 
ent. So  long  as  e  7^  0,  the  tensor  must  have  three  orthonormal  eigenvectors. 
By  continuity,  this  must  still  be  true  in  the  limit  e  — >  0.  (The  symmetry 
of  T  is  important  here,  because  without  the  consequent  orthogonality  of 
eigenvectors  we  could  not  exclude  the  possibility  that  two  eigenvectors  that 
are  distinct  for  e  7^  0  have  the  same  limit  as  e  — >  0.  Indeed,  this  does 
happen  for  non-symmetric  tensors,  as  will  be  seen  in  a  different  context  in 
Appendix  C.) 

We  have  shown,  therefore,  that  any  symmetric  tensor  may  be  diagonal- 
ized  by  a  suitable  choice  of  axes.  This  was  the  result  we  used  for  the  inertia 
tensor  in  Chapter  9.  In  that  case,  the  eigenvectors  are  the  principal  axes, 
and  the  eigenvalues  the  principal  moments  of  inertia.  The  procedure  for 
finding  normal  co-ordinates  for  an  oscillating  system,  discussed  in  §11.2, 
is  essentially  the  same.  In  that  case,  it  is  the  potential  energy  function 
that  is  brought  to  'diagonal'  form.  Eigenvalue  equations  also  appear  in  the 


Al02  •  di  =  \2CL\  ■  a2. 


T  = 


Ai  0  0" 
0  A2  0 
.0   0  A3. 


(A.65) 


T  =  Aieiei  +  A2e2e2  +  A3e3e3. 


(A.66) 
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analysis  of  dynamical  systems  in  Chapter  13  and  in  many  other  branches 
of  physics,  in  particular  playing  a  big  role  in  quantum  mechanics. 


Problems 

1.  Given  a  =  (3, -1,2),  6=  (0,1,1)  and  c=  (2,2,-1),  find: 

(a)  a  ■  b,  a  ■  c  and  a  ■  (b  +  c); 

(b)  a  A  b,  a  A  c  and  a  A  (b  +  c); 

(c)  (a  A  b)  ■  c  and  (a  A  c)  •  f>; 

(d)  (a  A  6)  A  c  and  (a  A  c)  A  b; 

(e)  (a  •  c)b  —  (b  ■  c)a  and  (a  ■  b)c  —  (b  ■  c)a. 

2.  Find  the  angles  between  the  vectors  a  A  b  and  c,  and  between  a  A  c 
and  fe,  where  a,  f>,  c  are  as  in  Problem  1. 

3.  Show  that  c  =  (ab  +  ba)/(a  +  b)  bisects  the  angle  between  a  and  b, 
where  a  and  b  are  any  two  vectors. 

4.  Find  V0  if  <^>  =  x3  —  xyz.  Verify  that  V  A  V0  =  0,  and  evaluate  V2<^>. 

5.  (a)  Find  the  gradients  of  u  =  x  +  y2/x  and  v  =  y  +  x2/y,  and  show 

that  they  are  always  orthogonal, 
(b)  Describe  the  contour  curves  of  u  and  v  in  the  xy-pl&ne.  What  does 
(a)  tell  you  about  these  curves? 

6.  Draw  appropriate  figures  to  give  geometric  proofs  for  the  following  laws 
of  vector  algebra: 


(Note  that  a,  b,  c  need  not  be  coplanar.) 
7.  Show  that  (a  A  b)  ■  (c  A  d)  =  a  ■  cb  ■  d  —  a  ■  db  ■  c.  Hence  show  that 


8.  Express  V  A  (a  A  b)  in  terms  of  scalar  products. 

9.  If  the  vector  field  v(r)  is  defined  by  v  =  ujk  A  r,  verify  that  V  •  v  =  0, 
and  evaluate  the  vorticity  V  A  v. 

10.  *Show  that,  if  u  and  v  are  scalar  fields,  the  maxima  and  minima  of  u 
on  the  surface  v  =  0  are  points  where  Vw  =  XVv  for  some  value  of  A. 
Interpret  this  equation  geometrically.  (Hint:  On  v  =  0  only  two  co- 
ordinates can  vary  independently.  Thus  Sz  for  example  can  usually  be 
expressed  in  terms  of  Sx  and  5y.  We  require  that  6u  should  vanish  for 


(a  +  b)+c 
A(a  +  b) 
a  -  (b  +  c) 


a  +  (b  +  c); 
\a  +  Ab; 
a  ■  b  +  a  ■  c. 


(aAb)2 


a2b2  -{a-bf. 


406 


Classical  Mechanics 


all  infinitesimal  variations  satisfying  this  constraint.)  Show  that  this 
problem  is  equivalent  to  finding  the  unrestricted  maxima  and  minima 
of  the  function  w(r,  A)  =  u  —  Xv  as  a  function  of  the  four  independent 
variables  x,  y,  z  and  A.  Here  A  is  called  a  Lagrange  multiplier.  What  is 
the  role  of  the  equation  dw/dX  =  0? 

11.  ^Evaluate  the  components  of  V  A  in  cylindrical  polar  co-ordinates  by 
using  the  identity  (A. 28).  Show  that  they  are  not  the  same  as  the 
scalar  Laplacians  of  the  components  of  A. 

12.  *Find  the  radiation-gauge  vector  potential  at  large  distances  from  a 
circular  loop  of  radius  a  carrying  an  electric  current  /.  [Hint:  Consider 
first  a  point  (x,  0,  z),  and  expand  the  integrand  in  powers  oia/r,  keeping 
only  the  linear  term.  Then  express  your  answer  in  spherical  polars.] 
Hence  find  the  magnetic  field  —  the  field  of  a  magnetic  dipole.  Express 
the  results  in  terms  of  the  magnetic  moment  /x,  a  vector  normal  to  the 
loop,  of  magnitude  fi  =  -ko?I. 

13.  ^Calculate  the  vector  potential  due  to  a  short  segment  of  wire  of  di- 
rected length  ds,  carrying  a  current  /,  placed  at  the  origin.  Evaluate 
the  corresponding  magnetic  field.  Find  the  force  on  another  segment, 
of  length  ds',  carrying  current  I',  at  r.  (To  compute  the  force,  treat  the 
current  element  as  a  collection  of  moving  charges.)  Show  that  this  force 
does  not  satisfy  Newton's  third  law.  (To  preserve  the  law  of  conserva- 
tion of  momentum,  one  must  assume  that,  while  this  force  is  acting, 
some  momentum  is  transferred  to  the  electromagnetic  field.) 

14.  *Given  u  =  cosd  and  v  =  lnr,  evaluate  A  =  uV«  —  wVti.  Find  the 
divergence  and  curl  of  A,  and  verify  that  V  •  A  =  tiV2»  —  «V2«  and 
that  V  A  A  =  2Vu  A  Vu. 

15.  *Show  that  the  rotation  which  takes  the  axes  i,j,k  into  i',j',k'  may 
be  specified  by  r  — >  r'  =  Rr,  where  the  tensor  R  is  R  =  i'i+j'j  +  k'k. 
Write  down  the  matrix  of  components  of  R  if  the  rotation  is  through 
an  angle  6  about  the  y-axis.  What  is  the  tensor  corresponding  to  the 
rotation  which  takes  i',j',k'  back  into  i,j,kl  Show  that  R  •  R  =  1. 
(Such  tensors  are  said  to  be  orthogonal.) 

16.  *The  trace  of  a  tensor  T  is  the  sum  of  its  diagonal  elements,  tr(T)  = 
J2i  Tu-  Show  that  the  trace  is  equal  to  the  sum  of  the  eigenvalues,  and 
that  the  determinant  det(T)  is  equal  to  the  product  of  the  eigenvalues. 

17.  *The  double  dot  product  of  two  tensors  is  defined  as  S:T  =  tr(S  •  T)  = 
Y^i  J2j  SijTji-  Evaluate  1  :  1  and  1  :  rr.  Show  that 

(3rV  -  r'2l)  :  (rr  -  |r2l)  =  3(r'  •  r)2  -  r'V. 
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Hence  show  that  the  expansion  (6.19)  of  the  potential  may  be  written 

and  write  down  an  expression  for  the  quadrupole  tensor  Q.  Show  that 
tr(Q)  =  0,  and  that  in  the  axially  symmetric  case  it  has  diagonal  el- 
ements —  \Q,—  \Q,Q,  where  Q  is  the  quadrupole  moment  defined  in 
Chapter  6.  Show  also  that  the  gravitational  quadrupole  tensor  is  re- 
lated to  the  inertia  tensor  I  by  Q  =  tr(l)l  -  31. 
18.  *In  an  elastic  solid  in  equilibrium,  the  force  across  a  small  area  may 
have  both  a  normal  component  (of  compression  or  tension)  and  trans- 
verse components  (shearing  stress).  Denote  the  ith  component  of  force 
per  unit  area  across  an  area  with  normal  in  the  jth  direction  by  T^. 
These  are  the  components  of  the  stress  tensor  T.  By  considering  the 
equilibrium  of  a  small  volume,  show  that  the  force  across  area  A  with 
normal  in  the  direction  of  the  unit  vector  n  is  F  =  T  ■  nA.  Show  also 
by  considering  the  equilibrium  of  a  small  rectangular  volume  that  T  is 
symmetric.  What  physical  significance  attaches  to  its  eigenvectors? 
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Appendix  B 

Conies 


Conic  sections,  or  conies  for  short,  are  most  simply  defined  as  curves  in 
a  plane  whose  equation  in  Cartesian  co-ordinates  is  quadratic  in  x  and  y. 
The  name  derives  from  the  fact  that  they  can  be  obtained  by  making  a 
plane  section  through  a  circular  cone.  They  turn  up  in  several  physical 
applications,  particularly  in  the  theory  of  orbits  under  an  inverse  square 
law  force.  It  may  be  useful  to  gather  together  the  relevant  mathematical 
information. 

B.l     Cartesian  Form 

The  most  general  conic  would  have  an  equation  of  the  form 

Ax2  +  2Bxy  +  Cy2  +  2Dx  +  2Ey  +  F  =  0, 

where  A,  B, . . . ,  F  are  real  constants,  but  by  choosing  the  axes  appropri- 
ately we  can  reduce  this  to  a  simpler  form. 

First,  we  look  at  the  quadratic  part,  Ax2  +  2Bxy  +  Cy2.  It  is  always 
possible  by  rotating  the  axes  to  eliminate  the  constant  B.  This  is  another 
example  of  the  diagonalization  process  described  in  §11.3  and  §A.10.  The 
quadratic  part  of  the  equation  is  then  reduced  to  a  sum  of  squares,  A'x'2  + 
Cy'2 .  We  then  forget  about  the  original  co-ordinates,  and  drop  the  primes. 
The  nature  of  the  curve  is  largely  determined  by  the  ratio  A/C  of  the  new 
constants. 

Let  us  assume  for  the  moment  that  A  and  C  are  both  non-zero  (we  will 
come  back  later  to  the  special  case  where  that  isn't  true).  Then  we  can 
choose  to  shift  the  origin  (adding  constants  to  x  and  y,  e.g.,  x'  =  x  +  D/A) 
so  as  to  remove  D  and  E.  If  F  is  also  non-zero,  we  can  move  it  to  the  other 
side  of  the  equation,  and  divide  by  —F,  to  get  the  standard  form  of  the 
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equation, 


Ax2  +  Cy2  =  1. 


(B.l) 


F  =  0  is  a  degenerate  case:  if  A  and  C  have  the  same  sign,  the  only  solution 
is  x  =  y  =  0;  if  they  are  of  opposite  sign,  the  equation  factorizes,  and  so 
represents  a  pair  of  straight  lines,  y  =  ±yj —A/C  x. 

We  cannot  allow  both  A  and  C  in  (B.l)  to  be  negative;  the  equation 
would  then  have  no  solutions  at  all.  So  we  can  distinguish  two  cases: 

1.  Both  A  and  C  are  positive.  Without  loss  of  generality  we  can  assume 
that  A  <  C.  (If  A  >  C,  we  simply  interchange  the  x  and  y  axes.) 
Defining  new  positive  constants  a  and  b  by  A  =  1/a2  and  C  =  1/b2, 
we  finally  arrive  at  the  canonical  form  of  the  equation, 


(B.2) 


a-  b2 

This  is  the  equation  of  an  ellipse  (see  Fig.  B.l).  Here  a  >  b;  a  is 
the  semi-major  axis  and  b  is  the  semi-minor  axis.  (In  the  special  case 


x 

Fig.  B.l 


a  =  6,  we  have  a  circte  of  radius  a.) 

A  and  C  have  opposite  signs.  Again,  without  loss  of  generality,  we  can 
assume  that  A  >  0  and  C  <  0.  So,  defining  ,4  =  1/a2  and  C  =  — 1/62, 
we  get 

=  1,  (B.3) 


2T 
62 


the  equation  of  a  hyperbola  (see  Fig.  B.2);  a  and  b  are  still  called  the 
semi-major  axis  and  the  semi-minor  axis  respectively,  although  it  is  no 
longer  necessarily  true  that  a  is  the  larger.  Note  that  this  curve  has  two 
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Fig.  B.2 

separate  branches  on  opposite  sides  of  the  origin.  At  large  distances, 
it  asymptotically  approaches  the  two  straight  lines  y  =  ±(b/a)x  shown 
on  the  figure. 

We  still  have  to  consider  the  special  case  where  one  of  the  constants  A 
and  C  vanishes.  (They  cannot  both  vanish,  otherwise  we  have  simply  a 
linear  equation,  representing  a  straight  line.)  Without  loss  of  generality,  we 
may  assume  that  A  =  0  and  C  /  0.  As  before,  we  can  shift  the  origin  in  the 
y  direction  to  eliminate  E.  On  the  other  hand,  D  cannot  be  zero  (otherwise 
x  doesn't  appear  at  all  in  the  equation).  This  time,  we  can  choose  the  origin 


I 

111  X 

Fig.  B.3 

in  the  x  direction  to  make  F  =  0  (by  setting  x'  = 
defining  /  =  —D/C,  we  arrive  at  the  canonical  form 

y2  =  2lx, 

which  is  the  equation  of  a  parabola  (see  Fig.  B.3). 


x  +  F/2D).  Finally, 
(B.4) 
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Areas  are  easy  to  compute  using  the  Cartesian  form  of  the  equation. 
For  example,  we  can  solve  (B.2)  for  y  and  integrate  to  find  the  area  of  the 
ellipse;  the  result,  which  generalizes  the  familiar  irr2  for  a  circle,  is  nab.  (In 
fact,  the  ellipse  may  be  thought  of  as  a  circle  of  radius  a  which  has  been 
squashed  uniformly  in  the  y  direction  in  the  ratio  b/a.) 


B.2    Polar  Form 

When  we  are  looking  for  orbits  under  the  action  of  a  central  force,  it  is 
usually  convenient  to  use  polar  co-ordinates.  The  form  of  the  equation 
that  emerged  from  the  discussion  in  §4.4  was 

r(ecos6>±  1)  =  I,  (B.5) 

where  e  and  I  are  constants  satisfying  e  >  0,  I  >  0  (the  upper  and  lower 
signs  refer  to  the  attractive  and  repulsive  cases,  respectively). 

It  is  interesting  to  see  how  this  form  is  related  to  the  Cartesian  form 
above.  If  we  rearrange  (B.5)  and  square  it,  we  obtain  for  both  signs  the 
equation 

x2  +  y2  =  (I  -  ex)2.  (B.6) 

This  can  easily  be  put  into  one  of  the  canonical  forms  above;  which  one 
depends  on  the  value  of  e. 

I.  If  e  <  1,  we  can  'complete  the  square'  in  (B.6)  and  write  it  as 

e2;2  ;2 

(1  -  e2)x2  +  2elx  +  .  +  y2  =  (B.7) 

1  —  eA  I  —  ez 

Dividing  by  I2 / (1  —  e2),  this  reduces  almost  to  the  form  (B.2),  where 

The  only  difference  is  that  the  origin  is  not  at  the  centre  of  the  ellipse: 
(B.7)  is  equivalent  to 

^#  +  g  =  l,  (B.9) 
az  bz 

an  ellipse  with  centre  at  (— ae,0). 
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II.  If  e  >  1,  we  complete  the  square  in  the  same  way  and  divide  by 
l2/(e2  -  1),  obtaining 

(X~aef     £  =  1,  (B.10) 


a2  b2 


where  now 


1 


(B.ll) 


This  is  a  hyperbola  with  centre  at  (ae,0).  The  left-hand  branch,  intersect- 
ing the  x-axis  at  (ae  —  a,0),  corresponds  to  an  orbit  under  an  attractive 
inverse  square  law  force,  while  the  right-hand  one,  meeting  it  at  (ae  +  a,  0), 
corresponds  to  the  repulsive  case. 

III.  Finally,  if  e  =  1,  the  equation  can  be  written 

y2  =  l2  -  2lx,  (B.12) 

which  is  a  parabola  with  its  apex  at  (7/2,0),  and  oriented  in  the  opposite 
direction  to  (B.4). 

In  all  these  cases,  the  position  of  the  origin  is  one  focus  of  the  conic. 
In  cases  I  and  II  there  is  a  second  focus  symmetrically  placed  on  the  other 
side  of  the  centre;  for  the  parabola,  the  second  focus  is  at  infinity.  (The 
plural  of  focus  is  foci.)  The  reason  for  the  name  is  an  intriguing  geometric 
property  (see  Problem  2):  if  we  have  a  perfect  mirror  in  the  shape  of  an 
ellipse  light  from  a  source  at  one  focus  will  converge  to  the  second  focus. 
Similarly,  a  source  at  the  focus  of  a  parabolic  mirror  generates  a  parallel 
beam,  which  makes  parabolic  mirrors  ideal  for  certain  applications.  For  a 
hyperbolic  mirror  with  a  source  at  one  focus,  the  reflected  light  will  appear 
to  come  from  a  virtual  image  at  the  second  focus. 


Problems 

1.  The  equation  (B.2)  of  an  ellipse  can  be  written  in  parametric  form  as 
x  =  a  cost/),  y  =  bs'mip.  Show  [using  the  identity  b2  =  (1  —  e2)a2)]  that 
the  distances  between  the  point  labelled  ip  and  the  two  foci,  (±ae,  0),  are 
a(l^f  e  cos  ip),  and  hence  that  the  sum  of  the  two  distances  is  a  constant. 
(This  result  provides  a  commonly  used  method  of  drawing  an  ellipse,  by 
tying  a  string  between  two  pegs  at  the  foci,  stretching  it  round  a  pencil, 
and  drawing  a  curve  while  keeping  the  string  taut.) 
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2.  *Using  the  parametrization  of  the  previous  question,  show  that  the  slope 
of  the  curve  is  given  by  dy/dx  =  —  (b/a)  cotip.  Hence  show  that  the 
angles  between  the  curve  and  the  two  lines  joining  it  to  the  foci  are 
equal.  (One  way  is  to  find  the  scalar  products  between  the  unit  vector 
tangent  to  the  curve  and  the  unit  vectors  from  the  two  foci.  This  result 
provides  a  proof  of  the  focussing  property:  light  from  one  focus  converges 
to  the  other.) 

[The  results  stated  in  Problems  1,  2  imply  that  all  radiation  originating 
at  one  focus  of  an  ellipse  at  a  particular  time  is  then  reflected  to  the 
other  focus  with  the  same  time  of  arrival  —  a  consequence  with  many 
applications,  both  peaceful  and  otherwise.] 
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Phase  Plane  Analysis  near 
Critical  Points 


In  this  appendix  we  give  a  summary  of  the  types  of  behaviour  exhibited  by 
a  general  autonomous  dynamical  system  near  critical  points  in  the  phase 
plane  (n  =  2),  as  indicated  in  §13.3. 

C.l    Linear  Systems  and  their  Classification 

We  saw  in  §13.3  that,  in  the  local  expansion  near  a  critical  point  (xo,yo), 
the  key  to  the  local  behaviour  and  to  the  stability  of  the  equilibrium  at  the 
critical  point  is,  normally,  the  behaviour  of  the  linear  system 


T 

d 

=  M 

X 

A 

dt 

_'/_ 

Jl. 

(C.l) 


which  is  obtained  from  (13.14)  by  neglecting  higher-order  terms  in  the  ex- 
pansion. The  2x2  Jacobian  matrix  M  [a  tensor  of  valence  2  (§A.10)] 
has  constant  entries,  which  are  found  as  derivatives  of  the  functions 
F(x,y),G(x,y)  evaluated  at  the  critical  points  (xo,yo),  as  in  (13.11), 
(13.13). 
Consider 


where  a,  b,  c,  d  are  real  constants.  For  the  critical  point  itself,  at  which  £  = 
0,  r]  =  0,  to  be  an  isolated  critical  point  it  is  necessary  that  the  determinant 
of  M  is  non-zero.  That  is  to  say  ad  —  bc^Q  and  M  then  has  an  inverse.  If 
this  condition  is  not  satisfied,  so  that  M  is  singular,  then  there  is  at  least  a 
critical  line  through  £  =  0,  rj  =  0,  rather  than  just  the  single  point;  we  do 
not  consider  this  case  further  here. 
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If  we  seek  a  solution  to  (C.l)  in  the  form 

eAt  =  £0eAt,  (C.2) 
then  we  require 

M£0  =  A£0  (C3) 

and  this  is  an  eigenvalue/eigenvector  problem.  (See  §A.10,  although  M 
may  not  now  be  symmetric.) 

Here  the  eigenvalues  Ai,  A2  satisfy  the  quadratic  equation 

A2  -  (a  +  d)X  +  (ad  -  be)  =  0, 

that  is 

A2  -  (trM)A  +  (det  M)  =  (A  —  Ai)(A  -  A2)  =  0,  (C.4) 

so  that  Ai  +  A2  =  trM,  the  trace  of  M  and  A1A2  =  detM,  the  determi- 
nant of  M  (see  Appendix  A,  Problem  16).  The  eigenvalues  Ai,A2  lead 
to  corresponding  eigenvectors  £oij£o2  respectively,  in  principle,  but,  since 
the  matrix  M  is  not  necessarily  symmetric  we  have  eigenvalues  which  may 
not  be  real  and  a  set  of  eigenvectors  which  may  not  be  orthogonal  or  even 
complete. 

There  are  various  cases  depending  on  the  nature  of  the  eigenvalues  and 
we  can  consider  separately  the  cases  Ai  7^  A2  and  Ai  =  A2. 

1.  Ai  /  A2.  In  this  case,  because  of  the  linearity  of  the  system,  we  can 
write 

£(i)  =  ci£oieAlt  +  c2£o2eA2t,  (C.5) 

with  Ci,C2  constants.  The  vectors  £0ii£o2  are  independent  in  this  case 
and  any  vector  can  be  expressed  as  a  linear  combination  of  them.  In 
particular  £0  =  £(0)  leads  to  the  unique  values  of  c\,C2  corresponding 
to  given  initial  conditions.  In  this  situation  we  can  carry  out  a  linear 
change  of  variables 


Co 

7). 

Jlo. 
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similar  to  the  change  to  normal  co-ordinates  in  §11.4,  in  such  a  way  that 


T 

Ai  0" 

T 

0  A2_ 

(C.6) 


In  this  similarity  transformation  the  2x2  matrix  S  =  [£01 :  £02]  and  the 
diagonal  matrix  in  (C.6)  above  then  takes  the  form  S~1MS.  It  should 
be  noted  here  that  \i,  £oi,  Ci  (i  =  1,2)  could  be  complex,  but  even  then 
(C.5)  is  the  formal  expression  of  the  solution  for  £.  In  the  case  when  M 
is  symmetric  then  the  eigenvalues  Ai,A2  are  real  and  the  eigenvectors 
£oii  £02  are  orthogonal.  If  the  eigenvectors  are  normalized  to  have  unit 
length  then  S~x  =  S,  i.e.  S  is  a  rotation  matrix. 
2.  Ai  =  A2  (=  A).  In  this  case,  A  is  necessarily  real  and  we  may  find  that 
the  matrix  reduction  to  diagonal  form  indicated  above  may,  or  may  not, 
be  possible: 

(a)  If  we  can  find  two  distinct  eigenvectors  corresponding  to  A  then  the 
above  machinery  will  go  through  trivially,  since  the  matrix  M  =  XI 
in  this  case,  where  /  is  the  unit  matrix,  and  all  non-zero  vectors  are 
eigenvectors! 

(b)  If  there  are  not  two  distinct  eigenvectors  corresponding  to  A  then 
the  best  that  can  be  done  by  a  linear  transformation  is  to  reduce 
the  system  to 


d 

T 

'A  0' 

"f 

dt 

1  A 

JL 

(C.7) 


since  the  diagonal  form  is  not  now  achievable.  The  system  (C.7)  has 
the  solution 

(C.8) 

with  C\,C2  constants. 

Depending  on  the  eigenvalues  Ai,  A2  there  are  then  various  possible  cases 
to  consider.  These  are  listed  below  together  with  sketches  of  typical  pat- 
terns of  local  trajectories.  The  orientation  and  sense  of  rotation  in  these 
patterns  depends  on  the  system  concerned.  However,  in  each  case  the  di- 
rections of  the  arrows  indicate  evolution  with  time  t  along  the  trajectories. 


'0 

=  Cl 

1 

eAt  +  C2 
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Case  1 

Ai,A2  real,  unequal,  same  sign  =>  {improper)  node,  e.g.  negative  sign 
(Fig.  C.l). 


Fig.  C.l 

All  trajectories  except  for  one  pair  approach  the  critical  point  tangent 
to  the  same  line.  The  critical  point  is  asymptotically  stable. 

If  the  sign  of  Ai,  A2  is  positive  then  the  local  structure  is  similar  to  that 
above,  but  with  the  sense  of  the  arrows  reversed.  The  critical  point  is  then 
unstable. 

Case  2 

Ai ,  A2  real,  equal  or  unequal  magnitude,  opposite  sign  =>  saddle  (or 
hyperbolic  point)  (Fig.  C.2).  This  type  of  critical  point  is  always  unstable. 


Fig.  C.2 


Phase  Plane  Analysis  near  Critical  Points 


419 


Case  3 

Ai  =  A2  =  A  (necessarily  real). 

1.  When  M  =  XI  we  have  a  (proper)  node,  e.g.  A  negative  (Fig.  C.3).  This 


critical  point  is  asymptotically  stable. 


Fig.  C.3 

If  A  is  positive  then  the  local  structure  is  similar  to  that  above,  but  with 
the  sense  of  the  arrows  reversed.  The  critical  point  is  then  unstable. 
(A  proper  node  is  sometimes  called  a  star,  focus,  source  or  sink  as  ap- 
propriate.) 

2.  When  M  may  not  be  diagonalized,  so  that  there  is  only  a  single  eigen- 
vector corresponding  to  A,  we  have  an  improper  (or  inflected)  node,  e.g. 
A  negative  (Fig.  C.4).  This  critical  point  is  asymptotically  stable. 


Fig.  C.4 
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If  A  is  positive  then  the  local  structure  is  similar  to  that  above,  but  with 
the  sense  of  the  arrows  reversed.  The  critical  point  is  then  unstable. 

Case  4 

Ai;2  a  complex  conjugate  pair  ji  ±  iv,  with  /i  /  0  =>  spzra/  (point),  e.g. 
\i  negative  (Fig.  C.5). 


Fig.  C.5 

This  critical  point  is  asymptotically  stable. 

If  n  is  positive  then  the  local  structure  is  similar  to  that  above,  but  with 
the  sense  of  the  arrows  reversed.  The  critical  point  is  then  unstable. 

(A  spiral  is  sometimes  called  a  spiral  source  or  spiral  sink  as 
appropriate.) 

Case  5 

\±.2  a  pure  imaginary  conjugate  pair  ±iv  =>■  centre  (or  elliptic  point) 
(Fig.  C.6). 

The  sense  of  the  arrows  may  be  different,  but  this  type  of  critical  point 
is  always  stable. 

We  can  solve  equation  (C.4)  for  the  eigenvalues  Ai,  A2  in  terms  of  trM 
and  det  M  obtaining 

Ai,2  =  |(trM±\/A),  (C.9) 

where  the  discriminant  A  =  (trM)  —  4(det  M).  We  may  then  represent  the 
types  of  behaviour  in  the  phase  plane  near  a  critical  point  schematically. 
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Fig.  C.6 


(See  Fig.  C.7.)  Note  that  along  the  line  det  M  =  0  the  critical  point  is  not 
isolated. 


det  M 


4  =  0 


4<0 
Asymptotically 


4>fl 


stable  nodes 


stable 
•o  spirals 


4=0 


4>0 
Unstable 
s  nodes 


tr  M 


Saddles  4>0 


Fig.  C.7 


C.2    Almost  Linear  Systems 

We  have  seen  that  the  local  analysis  near  a  typical  critical  point  in  the 
phase  plane  leads  to  (13.14)  and  this  equation  differs  from  the  linear  system 
(C.l)  in  that  it  includes  some  higher-order  terms.  For  the  almost  linear 
system  (13.14)  the  classification  of  the  corresponding  linear  system  (C.l) 
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determines  the  local  phase  portrait  and  the  type  of  stability  in  almost  every 
case.  Small  changes  produced  by  the  higher-order  terms  are  evidently  going 
to  be  crucial,  if  at  all,  only  in  the  particular  cases: 


The  equal  real  eigenvalues  A,  A  (node)  could  split  to  give  A  ±  e  (node)  or 
A  ±  ie  (spiral),  where  e  is  small.  However,  the  stability  of  the  critical  point 
would  still  be  just  that  predicted  by  the  linear  system  analysis. 


The  pure  imaginary  conjugate  pair  of  eigenvalues  ±iv  (centre)  could  become 
±\(v  +  e)  (centre)  or  e  ±  if  (spiral),  where  e  is  small.  Naturally  a  centre 
would  still  indicate  that  the  critical  point  is  stable.  However,  the  spiral 
would  be  crucially  dependent  for  its  stability  on  the  sign  of  the  new  real 
part  e  of  the  eigenvalue  pair.  If  e  >  0  then  the  critical  point  is  unstable, 
whereas  if  e  <  0  then  the  critical  point  is  asymptotically  stable. 

So,  for  the  systems  we  are  considering,  it  is  only  when  the  exactly  linear 
analysis  of  §C.l  predicts  that  a  critical  point  is  a  centre  that  we  need  to  be 
suspicious  of  the  predictions  of  the  exactly  linear  analysis.  Whether  (13.14) 
has  a  true  centre  or  an  unstable  or  asymptotically  stable  spiral  has  to  be 
resolved  by  a  closer  scrutiny  of  the  particular  system  in  hand. 

It  is  the  case,  in  fact,  that  the  trajectories  near  a  critical  point  in  the 
phase  plane  have  a  topological  equivalence  in  the  linear  and  almost  linear- 
systems  except  when  there  is  a  zero  eigenvalue  (i.e.  the  critical  point  is  not 
isolated)  or  when  the  eigenvalues  are  pure  imaginary  (i.e.  a  centre)  —  this 
is  guaranteed  by  a  theorem  due  to  Hartman  and  Grobman.  For  example, 
we  can  examine  the  system 


which  has  only  one  critical  point  (at  the  origin  x  =  0,  y  =  0).  For  the  linear 
system,  in  the  expansion  about  the  origin  we  have 


Case  3 


Case  5 


dx 
dt 
dy 
dt 


y  +  x2 


(CIO) 


M  = 


1  0 
0  -1 
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so  that  the  eigenvalues  are  Ai 


1,A2  = 

T 

"0 

0 

1 

1  with  eigenvectors 


respectively.  The  trajectories  near  the  origin,  which  is  a  saddle,  are  indi- 
cated in  Fig.  C.8(a).  For  the  exact  nonlinear  system  (C.10)  we  can  write 


dx 


y 


+  X, 


(C.ll) 


so  that 


X  c 

 1 — ,        with  c  constant, 

3  x 


together  with  a  second  solution  x  =  0  (for  all  y). 

The  exact  family  of  trajectories  near  the  origin  is  indicated  in 
Fig.  C.8(b).  It  should  be  noted  that  the  trajectories  which  go  directly 
into  and  directly  out  of  the  critical  point  O  (respectively  the  stable  and  un- 
stable manifolds)  correspond  directly  at  and  near  O  for  the  exactly  linear 
and  almost  linear  systems  —  a  general  result  usually  known  as  the  stable 
manifold  theorem. 


Fig.  C.8 


C.3    Systems  of  Third  (and  Higher)  Order 

As  we  indicated  in  §13.6,  higher-order  systems  can  be  analyzed  in  a  similar 
fashion  to  that  carried  out  in  §13.3  and  earlier  in  this  appendix.  That  is  to 
say  the  critical  points  are  found  and  local  analysis  effected  about  each  of 
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them  by  linearization.  The  resulting  eigenvalue/eigenvector  problems  de- 
termine the  local  stability  of  the  critical  points  and  the  local  phase  portrait 
structures  normally  determine  the  global  phase  portrait  for  the  complete 
system. 

For  a  third-order  system,  e.g.  the  Lorenz  system  of  §13.6,  the  matrix  M 
for  a  particular  critical  point  is  3  x  3  [again  a  tensor  of  valence  2  (§A.9,10)], 
so  that  the  three  eigenvalues  satisfy  a  cubic  equation.  This  implies  that  at 
least  one  of  the  eigenvalues  must  be  real,  with  the  others  either  both  real 
or  a  complex  conjugate  pair.  We  only  note  here  that,  if  all  the  eigenval- 
ues are  negative  real  or  have  negative  real  part,  then  the  critical  point  is 
asymptotically  stable.  Even  if  only  one  of  the  eigenvalues  is  positive  or  if 
the  complex  conjugate  pair  has  positive  real  part,  then  the  critical  point  is 
unstable. 


Problems 

1.  Find  the  critical  points  of  the  following  systems  and  classify  them  ac- 
cording to  their  local  linear  approximations: 

(a)  x  =  -3x  +  y,  y  =  4x  -  2y; 

(b)  x  =  3x  +  y,  y  =  2x  +  2y; 

(c)  x  =  —6x  +  2xy  —  8,  y  =  y2  —  x2; 
(A)  x  =  —2x  —  y  +  2,  y  =  xy; 

(e)  x  =  4  —  Ax2  —  y2,  y  =  3xy; 

(f)  x  =  s'my,  y  =  x  +  x3; 

(g)  x  =  y,  y  =  ^-  i+~2v  j  x  in  the  cases  ui2  <  a  and  lj2  >  a. 

2.  For  the  nonlinear  oscillator  equation  x  +  x  =  x3,  write  x  =  y  and  show 
that  there  are  two  saddle  points  and  one  centre  in  the  linear  approxi- 
mation about  the  critical  points  in  the  (x,  y)  phase  plane.  Integrate  the 
equations  of  the  system  to  obtain  an  'energy'  equation  and  use  this  to 
show  that 

(a)  the  centre  is  a  true  centre  for  the  full  system; 

(b)  the  equation  of  the  separatrices  through  the  saddles  is 


2y2 


.2 


2)  +  1. 


Appendix  D 

Discrete  Dynamical  Systems  —  Maps 


In  this  appendix  we  consider  discrete  dynamical  systems  in  which  a  space 
is  effectively  mapped  onto  itself  repeatedly.  We  recognized  in  Chapter  13 
that  for  some  systems  it  is  appropriate  and  useful  to  observe  at  discrete 
time  intervals,  which  are  not  necessarily  equal  —  this  is,  for  example,  often 
the  case  for  biological  systems. 

Also  we  saw  in  §14.2  the  concept  of  a  Poincare  return  map,  where  the 
evolution  of  a  system  through  its  dynamics  induces  a  map  of  a  Poincare 
section  onto  itself.  Examining  properties  of  maps,  in  their  own  right,  will 
give  insight  into  mechanisms  of  chaotic  breakdown  in  continuous  systems 
as  well. 

D.l     One-dimensional  Maps 

We  consider  a  map  given  by 

xn+1  =  F(xn),  (D.l) 

for  n  =  0, 1,  2, . . .  and  with  F  a  known  function,  and  consider  possible  be- 
haviours of  xn  for  suitable  initial  values  xq  ,  as  we  iterate  to  find  successively 
xi  =  F(xq),  x2  =  F(x\)  =  F(F(xo))  =  F(2\xo),  etc.  We  can  expect  to 
find  any  fixed  points  X  as  solutions  of 

X  =  F{X).  (D.2) 

To  examine  the  stability  of  the  fixed  point  X,  we  may  write  xn  +  en  =  X, 
for  each  n,  and,  when  e„  is  small,  we  can  expand  F(xn)  in  (D.l)  in  the 
form 

F(xn)  =  F(X  -  c„)  =  F(X)  -  enF'(X)  +  \e2nF"{X)  +  ...,  (D.3) 
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where  F'(X)  =  [dF(x)/dx]x=x,  etc. 

A  fixed  point  X  is  asymptotically  stable  (and  therefore  an  attractor)  if 
\F'(X)\  <  1.  We  may  consider  different  cases  (where  e„  — >  0): 

•  0  <  \F'(X)\  <  1    =>    e„+i  ~  F'(X)en  as  n  — >  oo,  and  we  have 
first-order  convergence. 

•  F'(X)  =  0,  F"(X)  ^  0  =^>   e„+i  ~  -\F"(X)ei  as  n  ->  oo,  and  we 
have  second-order  convergence. 

While  this  sequence  may  be  continued,  the  key  criterion  is  that  stated  above 
for  \F'(X)\.  We  note  that  the  case  \F'(X)\  >  1  leads  to  instability  of  X, 
and  that  the  case  \F'(X)\  =  1  depends  more  specifically  on  the  function 
F(X). 

A  familiar  example  of  what  is  normally  second-order  convergence  is 
the  Newton-Raphson  iteration  process  to  find  roots  of  a  single  equation 
f(x)  =  0.  Here  xn+i  =  xn  —  f(xn)/f'{xn)  and  each  root  has  a  basin  of 
attraction,  so  that  we  can  find  all  the  roots  by  judicious  choices  of  xq. 

The  very  simplest  map  is  the  linear  map: 

xn+1  =  rxn,  (D.4) 

with  r  constant  (and,  say,  non-negative),  and  it  is  evident  that  xn  =  rnxo 
in  this  case.  Here  there  are  various  behaviours  depending  on  r: 

•  0  <  r  <  1:  xn  — >  0  for  all  a^o  [asymptotic  stability  of  X  =  0]. 

•  r  =  1:  xn  =  xq  for  all  xq  [steady  state]. 

•  r  >  1:  xn  =  Xq  exp(nlnr)  [exponential  growth]. 

If  this  were  a  biological  model,  of  e.g.  a  seasonal  breeding  population  xn, 
then  the  rate  constant  r  is  crucial  in  determining  the  fate  of  any  initial 
population  xq. 

The  logistic  map 

A  simple  nonlinear  map,  derived  from  (D.4),  is 

xn+1  =  rxn  -  sxl,  (D.5) 

which  is  called  the  logistic  map  and  has  apparent  similarity  with  the  logistic 
differential  equation  (13.3).  In  a  biological  context  the  rate  constant  r, 
quantifying  the  ability  of  the  population  to  reproduce,  is  balanced  by  the 
parameter  s,  which  quantifies  the  effect  of  overcrowding.  This  model  formed 
the  centrepiece  of  what  has  become  a  very  influential  paper  —  'Simple 
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mathematical  models  with  very  complicated  dynamics',  May,  Nature,  261, 
459-467,  1976. 

A  simple  scaling  xn  =  sxn/r  leads  to  xn+i  =  rx„(l  —  xn),  and  it  is 
evident  that  the  overbar  may  then  be  dropped,  in  order  to  find  the  map 

xn+i  =  rxn(l  -  xn),  (D.6) 

which  is  the  logistic  map  in  standard  form,  with  r  the  single  key  parameter. 

Naturally  the  primary  physical/biological  interest  is  in  the  case  where 
the  x  interval  [0, 1]  is  mapped  by  (D.6)  onto  [0, 1],  which  requires  0  <  r  <  4 
—  for  other  applications  this  r  restriction  might  well  be  absent. 

Despite  the  apparent  similarity  with  the  continuous  system  (13.3)  the 
maps  (D.5),  (D.6)  have  some  very  different  and  complex  properties. 

We  see  immediately  that  there  are  two  fixed  points  of  (D.6)  —  at  X  =  0, 
X  =  1  -  1/r  —  and,  in  each  case,  F'(X)  =  r(l  -  2X). 

It  is  helpful  for  maps  to  consider  [see  Fig.  D.l(a)]  a  plot  of  y  =  x  and 
y  =  F(x)  so  that  by  tracing  the  vertical  and  horizontal  lines  between  the 
two  we  can  follow  the  sequence  xo  — >  x\  — >  xn  — >  . . . ,  and  thus  see  whether 
or  not  it  might  converge. 


Fig.  D.l 

The  behaviours  of  the  map  (D.6)  for  different  values  of  r  can  be  sum- 
marized as  follows: 

•  0<r<l:X  =  0is  asymptotically  stable  and  X  =  I  —  1/r  is  unsta- 
ble. [X  =  0  is  a  point  attractor  —  if  the  corresponding  linear  model 
population  cannot  sustain  itself  then  overcrowding  makes  it  worse!] 

•  1  <  r  <  3:  X  =  0  is  unstable  and  X  =  1  —  1/r  is  asymptotically  stable 

-  see  Fig.  D.l  (a)  for  example.   [X  =  1  —  1/r  is  a  point  attractor  — 
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exponential  growth  is  stabilized  by  overcrowding,  in  very  similar  fashion 
to  the  behaviour  of  the  logistic  differential  equation  (13.3).] 
•  3  <  r  <  4:  X  =  0  and  X  =  1  —  1/r  are  now  both  unstable.  As  r 
increases  successive  'period- doubling  bifurcations'"  occur  as  asymptotic 
stability  is  exchanged  between  lower-  and  higher-order  cycles  (termed  a 
supercritical  flip  bifurcation): 

*3<r<  1  +  y/6  =  3.449  48- ••:  xn  — >  an  asymptotically  stable 
2-cycle  (or  period-2  solution).  If  we  examine  xn+2  =  F(xn+i)  = 
F^  (xn),  then  we  obtain  an  equation  of  degree  4  for  the  fixed  points 
of  this  iterated  mapping.  Of  course  X  =  0,  X  =  1  —  1/r  are  two 
of  the  roots  of  this  equation.  The  nontrivial  solutions  Xi,X2,  such 
that  F(Xi)  =  X2,  F(X2)  =  X1  [see  Fig.  D.l(b)],  are  roots  of  the 
quadratic  r2X2  —  r(r  +  1)X  +  (r  +  1)  =  0.  For  asymptotic  stability 
it  is  necessary  (see  Problem  I)  that  4  +  2r  —  r2  <  1  or  equivalently, 
for  positive  r,  3  <  r  <  1  + 

*  3.449 48-- •  <  r  <  3.54409- ••:  xn  ->  an  asymptotically  stable  4- 
cycle  found  from  an  equation  of  degree  16;  four  more  roots  are  the 
trivial  solutions  0, 1  —  1/r, X\,X2  (the  other  roots,  when  real,  give 
4-cycles  arising  through  a  different  process). 

*  3.544  09- •  <  r  <  3.564  40- ••:  xn  ->  an  asymptotically  stable  8- 
cycle,  and  so  on  (each  time  in  a  shorter  interval  in  r)  ...  until 

*  r  =  3.569  94  •  •  • :  Accumulation  point  of  2°°-cycle. 

*  3.569  94  •  •  •  <  r  <  4:  For  some  values  of  r  there  are  asymptotically 
stable  cycles  of  different  lengths,  but  for  others  the  xn  values  range 
seemingly  over  a  whole  continuous  interval,  in  an  apparently  random 
fashion.  An  intriguing  fact  is  that  odd-period  cycles  only  appear  for 
r  >  3.67857-". 

The  logistic  map  attractors  are  shown  in  Fig.  D.2.  The  numbers  along  the 
top  are  the  cycle  periods. 

It  should  be  noted  that  this  figure  has  an  approximate  self-similarity 
at  higher  magnification,  in  that  the  period- doubling  cascade  is  broadly  re- 
peated as  other-period  asymptotically  stable  cycles  become  unstable  —  e.g. 
the  period-3  cycle  in  Fig.  D.2. 

That  the  period-doubling  proceeds  broadly  in  geometric  fashion  in  the 
limit  was  discovered  by  Feigenbaum  in  1975.  Here  the  r-intervals  Aj  be- 
tween bifurcations  and  the  measures  di  of  width  in  X  of  successive  cycles 
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Odd  periods  begin 


Fig.  D.2 


are  such  that  (see  Figs.  D.2,  D.3) 


A./A, 
di/di. 


5  =  4.669  2016. 
a  =  2.502  9078. 


(D.7) 


in  each  case  in  the  limit  as  i  — >  oo. 

Feigenbaum  noted  that  essentially  all  'humped'  mapping  functions  F(x) 
lead  to  intersections  of  y  =  x  with  y  =  F(x),  y  =  F^(x),  y  =  F^lx), . . . 
which  are  similar  up  to  a  rescaling.  In  a  process  of  ' [renormalization'  the 
precise  form  of  F(x)  is  lost  and  in  1976  Feigenbaum  discovered  a  universal 
function  g(x),  which  is  self- reproducing  under  such  rescaling  and  iteration, 
so  describing  this  universal  property: 


-ag 


-x 
a 


(D.8) 


(see  Feigenbaum,  Journal  of  Statistical  Physics,  21,  669-706,  1979). 

The  universality  leads  to  5,  a  as  universal  constants  and  5  is  usually 
identified  as  the  Feigenbaum  number. 


430 


Classical  Mechanics 


Fig.  D.3 

As  to  the  onset  of  cycles  with  other  periods,  there  is  a  theorem,  due  to 
Sarkovskii  (1964),  which  is  as  follows: 

If  a  mapping  function  F(x)  has  a  point  xp  which  is  cyclic  of  order 
p,  then  it  must  also  have  a  point  xq  of  period  q,  for  every  q  which 
precedes  p  in  the  sequence: 

1  <=  2  <=  4  <=    •••    <=  2™  ^  ••• 


•••  <=2m.9<=2m.7<^2m.5<^2m.3 


■■■  <=  22.9  <=  22.7  <=  22.5  <=  22.3 
■■■  <=  2.9  <=  2.7  <=  2.5  <=  2.3 
...<=      9  <=      7  <=      5  <=  3. 

For  example,  the  existence  of  a  3-cycle  implies  the  existence  of  cycles  of  all 
the  other  periods! 

Naturally  the  odd  periods  cannot  arise  from  period-doubling,  but  do  so 
via  a  rather  different  process  which  may  be  examined  by  similar  methods 
to  those  employed  above. 

This  theorem  can  be  proved  using  a  continuity /intermediate  value  the- 
orem argument.  However  it  says  nothing  about  stability  of  these  cycles,  or 
the  ranges  of  r  (in  our  logistic  example)  for  which  they  may  be  observed. 
The  vast  majority  of  these  cycles  are  unstable  when  all  are  present,  and 
it  is  this  which  leads  to  a  brief  summary  statement  in  the  form  'Period  3 
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implies  chaos',  as  essentially  random  behaviour  of  iterates  xn  occurs  (Li 
and  Yorke,  American  Mathematical  Monthly,  82,  985-992,  1975,  in  which, 
incidentally,  the  term  chaos  was  first  introduced!). 

What  can  be  said  about  values  of  r  in  the  logistic  map  which  lead  to 
distributions  of  iterates,  rather  than  to  asymptotically  stable  cycles? 

Analytically  this  is  a  tough  problem.  However,  it  happens  that  there 
are  two  positive  values  of  r  for  which  a  formal  exact  solution  of  (D.6)  is 
known  —  r  =  2  and  r  =  4. 

The  former  (r  =  2)  allows  (1  —  2xn+i)  =  (1  —  2xn)2  leading  to  xn  = 
|[1  —  (1  —  2xo)2"}.  This  is  not  especially  interesting,  since  xn  — >  |  as  we 
should  expect. 

However  the  latter  possibility  (r  =  4)  allows  us  to  substitute  xn  = 
^[1  —  cos(2-7r#„)]  =  sin  (n9n)  and  this  leads  us  to  9n+\  =  29n  and  then 
9n  =  2™6*o.  Here  9  is  evidently  periodic,  with  period  1,  in  that  the  same 
x  is  generated  by  9  and  by  9  +  1.  Thus  we  may  write  any  9q  we  choose 
in  a  binary  representation  just  using  negative  powers  of  2  —  for  example, 
e0  =  l  +  l  +  ±  +  ±  +  ...  =  0.101 101 ... .  Then  6>i  =  0.011 01 . . . ,  92  = 
0.110  1 . . . ,  etc,  since  the  integer  part  may  be  cancelled  at  each  stage  on 
account  of  the  periodic  property. 

For  almost  all  choices  of  9q  then  the  9n  will  be  uniformly  distributed  on 
the  interval  [0, 1],  since  each  digit  in  the  binary  expansion  of  90  could  be 
chosen  with  equal  likelihood  to  be  a  0  or  a  1.  As  a  consequence  of  9  being 
uniformly  distributed,  then  x  is  not.  Indeed  x  has  a  probability  distribution 
given  by  P{x)  =  \/[ti^/x{1  -  xj]  (see  Figs.  D.4,  D.2). 


lP{x)  =  II  [7tVx(l-x)] 


0 


1 


X 


Fig.  D.4 
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We  note  that  P(x)  itself  is  the  attractor  here  for  r  =  4,  in  the  sense 
that  for  almost  all  choices  of  Xq  the  distribution  of  xn  will  approach  P{x) 
for  large  n.  It  is  called  an  invariant  probability  distribution. 

For  other  values  of  r  there  are  theoretical  results  for  the  correspond- 
ing probability  distribution  attractors  (via  an  equation  for  the  distribution 
function  —  Perron-Frobenius),  but  no  slick  result  like  that  above  for  r  =  4. 

The  distribution  attractors  are  also  characterized  by  an  exponential 
divergence  of  iterates,  leading  to  the  sensitivity  to  initial  conditions  charac- 
teristic of  chaos.  If  we  choose  to  examine  Xq  and  xq  +  eo,  with  eo  very  small, 
then  e„  ~  eo  exp(An)  on  average,  and  we  have  divergence  or  convergence  of 
iterates  according  as  A  >  0  or  A  <  0.  Here  A  is  a  Lyapunov  exponent  (see 


Values  of  A  can  be  found  by  numerical  computation.  In  measuring 
A  empirically  in  a  particular  practical  case,  we  allow  n  to  become  large 
enough  so  that  our  estimate  of  A  can  settle  down  to  a  steady  value.  We 
also  average  over  various  different  xo,  in  order  to  avoid  an  atypical  result 
through  a  single  unfortunate  choice.  For  the  logistic  map  Fig.  D.5  shows  a 
plot  of  A  as  r  varies. 

Here  F'(x)  =  r(l  —  2x),  so  that  we  expect: 

•  0  <  r  <  1:  A  =  lnr, 

•  1  <  r  <  3:  A  =  In  1 2  —  r  (leading  to  an  infinite  spike  when  r  =  2), 

•  3  <  r  <  1  +  y/6:  A  =  \  In  |r(l  -  2XX)\  +  \  In  |r(l  -  2AT2)|  [leading  to  an 
infinite  spike  when  X\  =  |  (requiring  r  =  1  +  y/5  =  3.236)], 

•  r  =  4:  since  P(x)  =  1/[tt^/x(1  —  x)]  so  we  have 


§13.7). 


Since  we  have  F^(xq  +  eo)  -  F^(xo)  ~  e0eAn,  then  we  have 


(D.9) 


A  = 


/  (ln|F'(a;)|)P(a;)da; 


f1  In  |4(1  -  2x)\ dx 


=  ln2  =  0.693147. . .. 


JO         TTy/xil  -  X) 


In  any  event,  the  regions  of  r  for  which  there  are  positive  values  of  A  indi- 
cate the  sensitivity  to  initial  conditions  at  these  r  values.  While  there  are 
other  measures  of  the  complexity /disorder  of  the  iterates  in  these  cases  — 
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Fig.  D.5  [Baker  and  Gollub  (Chaotic  Dynamics,  2nd  ed.,  Cambridge  University  Press, 
1996)] 

e.g.  the  system  'entropy'  —  these  will  not  be  pursued  further  here.  However, 
see  Problem  4  for  another  (and  simpler)  example  of  a  chaotic  map. 

The  logistic  map  (D.6),  together  with  other  similar  maps,  involves 
stretching  and  folding,  so  that  states  diverging  exponentially  are  still 
broadly  confined  to  a  bounded  region.  The  loss  of  information  about  initial 
conditions,  as  the  iteration  process  proceeds  in  a  chaotic  regime,  is  asso- 
ciated with  the  non-invertibility  of  the  mapping  function  F(x),  i.e.  while 
xn+i  is  uniquely  determined  from  xn,  each  xn  can  come  from  2  possible 
xn-\,  4  possible  s„_2,  etc,  and  eventually  from  2™  possible  x0.  Hence 
system  memory  of  initial  conditions  becomes  blurred! 

Many  continuous  systems  —  for  example  the  Lorenz  system  of  §13.6  — 
exhibit  a  similar  period-doubling  in  their  dynamics. 

D.2    Two-dimensional  Maps 

For  two-dimensional  maps  any  new  universality  has  proved  harder  to  find! 
However,  there  are  interesting  phenomena.  We  may  write  our  generic  map 
in  the  form 


yn+1  =  G(xn,yn), 


(D.10) 
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for  integer  n  and  with  F,  G  known  functions.  Again  we  may  seek  fixed 
points  (X,  Y)  as  solutions  of 

X  =  F(X,Y), 

V   '  D.ll 
Y  =  G(X,Y).  V  ' 

By  a  Taylor  expansion,  near  X,Y  and  similar  to  that  carried  out  in  §13.3 
and  in  Appendix  C,  we  obtain 

(d  F    QF  \ 
!!)„,■  (D-I2) 

It  now  follows  that  (xn,  yn)  — >  (X,  Y)  if  and  only  if  the  eigenvalues  of  the 
matrix  M  all  have  modulus  less  than  1;  this  is  needed  to  force  Mn  — >  the 
zero  matrix  as  n  — >  oo. 

When  instability  sets  in  as  parameters  are  changed,  we  then  typically 
have  a  2-cycle  to  examine  with  (X±,  Y\)  +±  (X2,  Y2)  and  where  these  points 
are  the  solutions  of 

X  =  F[F(X,Y),G(X,Y)}, 

Y  =  G[F(X,Y),G(X,Y)],  "  ' 

other  than  the  fixed  points  of  (D.10)  found  earlier  and  which  satisfy  (D.ll). 
The  asymptotic  stability  of  this  2-cycle  depends  on  the  eigenvalues  of  the 
matrix  product 

' 8F   8F\  /OF  dF\ 


dG  dG  \9G9G 

dx    dy/  (x,y)=(Xi,Yi)  (x,y)=(X2,Y2) 


and  so  on. 


The  Henon  map 

Probably  the  most  celebrated  example  is  the  Henon  map  (Henon,  Commu- 
nications in  Mathematical  Physics,  50,  69-77,  1976): 

xn+1  =  1  -  ax2n  +  yn, 

(D.15) 

yn+i  =  bxn, 

with  a,  b  real  parameters,  and  where  the  normal  physical  interest  is  in  |fe|  < 
1.  This  map  was  constructed  to  exhibit  some  behaviours  similar  to  those 
of  the  Lorenz  system  of  §13.6.  Geometrically  the  map  may  be  considered 
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to  be  a  composition  of  three  separate  simple  maps  —  an  area-preserving 
fold,  a  contraction  \b\  in  the  x  direction,  an  area-preserving  rotation. 
There  are  two  fixed  points  given  by 

X=X±  =  [-(l-b)±  V(l  "  b)2  +  M/2a, 
Y  =  Y±  =  bX±, 

and  these  are  real  and  distinct  if  and  only  if  a  >  ao  =  —  j(l  —  b)2.  One 
of  the  fixed  points  is  then  always  unstable  and  the  other  is  asymptotically 
stable  if  a  <  «i  =  |(1  —  b)2  (see  Problem  7).  For  a  >  oi,  both  fixed  points 
are  unstable  and  we  get  period-doubling  to  a  2-cycle,  4-cycle,  etc.  The 
two-cycle  stability  is  determined  through  (D.14)  by  the  eigenvalues  of  the 
matrix  product 

MlM2=(    -2abX2  b  J' 

This  leads  to  asymptotic  stability  of  the  2-cycle  only  when  a\  <  a  <  a2 
with  a-i  =  (1  —  b)2  +  j(l  +  b)2 .  The  period-doubling  cascade  then  continues 
(see  Fig.  D.6). 


b 

a0 

ai 

a(T) 

a{*) 

0 

-0.25 

0.75 

-1.40 

0.3 

-0.1225 

0.3675 

-1.06 

1.40 

b=0  corresponds  to  the  one-dimensional  logistic  map  (see  Problem  2) 


Fig.  D.6 
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Beyond  the  cascade,  and  embedded  among  other-period  cycles,  there 
are  (a,b)  values  where  the  attractor  is  very  complex  (see,  e.g.,  Fig.  D.7, 
where  a  =  1.4,6=  0.3). 


Fig.  D.7 

Here  there  is  stretching  along  the  strands  of  the  attractor  and  squeezing 
across  them  —  with  associated  Lyapunov  exponents  which  are  respectively 
positive  (along)  and  negative  (across)  (see  Problem  8). 

This  chaotic  attractor  —  termed  1  strange'  —  has  some  similar  features 
to  the  Lorenz  attractor  of  Fig.  13.20.  It  also  has  a  fractal  character  in  that 
it  has  a  broad  similarity  in  features  and  relative  scale  at  all  magnifications. 

The  repetition  of  stretching,  squeezing  and  folding  onto  the  original 
region  is  characteristic  of  a  technical  construct  —  the  Smale  'horseshoe' 
(1960)  —  which  is  now  known  to  be  a  trademark  of  chaotic  systems. 

Smale  'stretched'  the  unit  square  in  one  co-ordinate  direction  and 
'squeezed'  it  in  the  other  direction,  then  placing  the  resulting  strip  (with 
one  'fold')  over  the  original  square  —  with  inevitable  overlap.  Infinite 
repetition  of  this  sequence  of  operations  leads  to  the  identification  of  an  at- 
tractor consisting  of  all  the  points  which  remain  within  the  original  square 
indefinitely.  This  attractor  has  great  topological  complexity  and,  while  it 
guarantees  the  'sensitivity  to  initial  conditions'  of  chaos,  proving  its  exis- 
tence for  a  particular  system  may  be  very  tough. 
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A  formal  demonstration  of  chaotic  dynamics  for  the  Henon  map  is  con- 
tained in  a  paper  by  Benedicks  and  Carleson  (  Annals  of  Mathematics,  133, 
73-169,  1991). 

In  passing  (and  for  reference  in  §D.3)  we  note  that  a  map  which  pre- 
serves area  would  have  |  detM|  =  1  in  (D.12)  and  then  the  stretching  and 
squeezing  exactly  compensate  each  other  (see  Problem  11).  For  the  Henon 
map  this  is  the  case  only  when  |6|  =  1.  Equal-area  maps  are  of  special 
interest  since  the  Poincare  return  map  for  a  section  through  a  Hamiltonian 
system  (see  §14.2)  has  the  equal-area  property.  Some  consequences  are 
explored  briefly  in  §D.3. 


D.3    Twist  Maps  and  Torus  Breakdown 

In  §14.1  we  noted  that  each  trajectory  of  an  integrable  Hamiltonian  system 
with  n  degrees  of  freedom  is  confined  to  the  surface  of  an  n-torus  in  the  2n- 
dimensional  phase  space.  The  n-tori  corresponding  to  the  range  of  initial 
conditions  are  'nested'  in  the  phase  space. 

In  §14.2  we  introduced  the  concept  of  a  Poincare  surface  of  section,  as 
a  slice  through  the  dynamical  structure.  For  n  =  2  degrees  of  freedom  this 
section  of  the  nested  torus  structure  is  a  continuum  of  closed  curves,  each 
one  of  which  is  intersected  by  one  of  its  own  torus  trajectories  in  a  sequence 
of  points  (see  Fig.  14.4). 

If  we  make  use  of  action/angle  variables,  as  described  in  §14.3  and  par- 
ticularly in  the  polar  form  of  Fig.  14.5(c),  then  the  closed  curves  of  the 
Poincare  section  can  be  taken  as  concentric  circles.  The  intersection  points 
of  a  trajectory  with  its  own  particular  one  of  these  circles  (of  radius  r) 
will  necessarily  be  twisted  successively  around  the  origin  (the  centre  of  the 
circle)  through  an  angle  2ira.  Here  a  is  the  rotation  number,  which  is  the 
ratio  of  normal  frequencies  characterizing  the  particular  torus  concerned  — 
it  therefore  depends  on  r. 

We  can  then  introduce  the  notion  of  a  twist  map,  T  [Moser  (1973)];  in 
polars: 


rn+i\  =  rn 
en+1J  ~  \en  +  27va(rn) 


(D.17) 


When  a  perturbation  Hamiltonian  is  introduced,  as  in  §14.6,  we  can 
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model  the  modified  situation  using  a  perturbed  twist  map,  Te: 

( rn+A  =  (  r™  +  £/(r«'  6>") 
\8n+iJ      \6n  +  2na(rn)  +  eg(r 

with  e  small  and  positive  and  with  /,  g  known  smooth  functions. 

We  can  now  ask  what  happens  to  a  circle  of  a  particular  radius,  which 
is  mapped  to  itself  (with  a  twist)  by  T,  as  the  perturbation  quantified 
by  e  in  Te  grows  from  zero  —  i.e.  as  the  integrable  system  T  becomes  a 
near-integrable  system  Te,  in  such  a  way  that  area  is  still  preserved. 

In  fact  the  answer  to  this  question  depends  on  the  rotation  number  a 
corresponding  to  our  particular  circle  chosen. 

If  a  is  an  irrational  number  then,  for  e  small  enough,  the  circle  undergoes 
some  distortion  (perturbation)  but  is  certainly  not  destroyed.  This  is  also 
true  for  the  corresponding  torus  in  the  full  phase  space.  This  result  is  in 
accord  with  the  KAM  theory  referred  to  in  §14.6. 

As  we  shall  see,  it  is  circles  for  (D.17),  and  their  tori,  corresponding  to 
rational  a,  which  break  down  under  perturbation,  leading  to  sensitivity  to 
initial  conditions  and  chaos. 

The  rational  a  are  'scanty,  but  dense'  among  the  real  numbers  (see 
Problem  12)  and  these  a  correspond  to  resonances  in  the  system.  As  in  the 
discussion  of  the  problem  of  small  denominators  in  §14.6,  the  sensitivity  to 
initial  conditions  is  strongest  for  the  rational  a  =  k/s  with  small  values 
of  s.  As  the  perturbation  grows  with  e,  the  breakdown  associated  with 
each  such  rational  a  broadens,  so  that  progressive  overlap  occurs,  leading 
eventually  to  complete  breakdown  of  the  torus  structure. 

Let  us  examine  the  twist  map  (D.17)  for  three  neighbouring  circles 
C_,C,  C+  corresponding  respectively  to  a_  <  k/s,  a  =  k/s,  a+  >  k/s 
(say),  with  k,  s  positive  integers  and  with  q_,  a+  irrational.  (We  have  here 
chosen  to  take  a  to  be  an  increasing  function  of  radius  r.) 

Then  applying  the  twist  map  T  successively  s  times  we  find  that  the 
effect  of  TM  (see  Fig.  D.8)  is  to  map  the  circles  to  themselves,  with  C 
invariant  and  with  C_,  C+  twisted  'rigidly'  clockwise,  anticlockwise  respec- 
tively —  this  is  so  since  27rsa_  <  2nk  and  2ivsa+  >  2nk.  All  points  of  C 
are  fixed  under  the  iterated  mapping 

Since  a_,  a+  are  irrational  the  circles  C_,  C+  are  only  mildly  distorted 
when  we  apply  the  map  Te  successively  s  times  instead  —  i.e.  when  we 
iterate  to  consider  the  effect  of  TeS\  However,  the  inner  and  outer  cir- 
cles C-,C+  are  still  twisted  clockwise,  anticlockwise  respectively  —  see 
Fig.  D.9. 
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Fig.  D.8 


Fig.  D.9 


Moving  outwards  radially  from  O  in  any  direction  we  can,  by  continuity, 
find  a  point  on  that  radius  arm  which  undergoes  no  net  twist  when  the 
iterated  map  is  applied  —  P  is  mapped  only  radially  (to  Q)  by  this 
process.  By  considering  all  possible  radius  arms  we  construct  the  closed 
curve  C  consisting  of  all  such  points  P  which  are  mapped  only  radially  by 
Te(s)  —  to  C. 

For  the  map  Te  to  model  a  section  of  a  Hamiltonian  dynamical  system 
(as,  of  course,  does  T  itself)  the  map  Te  —  and  hence  Te^  —  must  be  one 
of  equal  reflected  in  §14.2. 
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Since  the  areas  contained  within  the  curves  C  and  C  must  be  equal, 
there  is  in  general  an  even  number  of  intersections  of  these  curves,  which 

(s) 

correspond,  of  course,  to  fixed  points  of  the  iterated  map  Tev  '.  These 
points  are  all  that  remains  fixed  from  the  original  invariant  circle  C  of  the 
unperturbed  iterated  map  rw.  We  now  note  that  the  fixed  points  are  of 
alternating  type  as  we  move  around  C  (or  C)  —  see  Fig.  D.10,  where  the 
iterated  mapping  sense  of  flow  is  indicated. 


(a)  Elliptic  point  (E)  stable         (b)  Hyperbolic  point  (H)  unstable 

Fig.  D.10 


There  are  evidently  in  all  2ns  such  fixed  points  of  the  iterated  map  T}s> , 
where  n  is  a  positive  integer  (usually  1). 

The  statement  of  existence  of  these  fixed  points,  of  their  multiplicity 
and  their  alternating  stability  is  a  result  known  as  the  Poincare-Birkhoff 
Theorem  (1927). 

It  turns  out  that  the  elliptic  fixed  points  E  are  themselves  surrounded 
(at  higher  scales)  by  elliptic  and  hyperbolic  fixed  points  corresponding  to 
even  higher-order  frequency  resonances. 

For  the  hyperbolic  fixed  points  H  the  unstable  and  stable  manifolds 
(q.v.  also  in  §C.2)  for  neighbouring  such  points  in  the  same  family  cross  to 
form  what  are  called  homoclinic  intersections.  The  resulting  instability  at 
all  scales  leads  inevitably  to  sensitivity  to  initial  conditions,  in  that  trajec- 
tories of  the  system  have  to  twist  and  turn,  forming  a  homoclinic  tangle, 
in  order  not  to  se//-intersect,  while  maintaining  the  equal-area  property  of 
the  return  map.  This  results  in  the  stretching,  squeezing  and  folding  asso- 
ciated with  the  Smale  horseshoe  referred  to  in  §D.2  and  this  forces  a  dense 
interweaving  of  ordered  and  chaotic  motions  within  Hamiltonian  systems 
(Fig.  D.ll).  Recognition  by  Poincare  of  the  existence  of  such  tangles  was 
the  first  mathematical  realization  of  the  presence  of  what  we  now  call  chaos. 
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KAM  torus 


0  Elliptic  point  E 


•  Hyperbolic  point  H 


Schematic  chaotic 
breakdown 


Fig.  D.ll 


An  example  of  a  map  in  which  some  of  the  breakdown  to  chaos  in  a 
Poincare  section  is  apparent  is  that  for  the  oval  billiard  map  (Fig.  14.13). 

For  further  exploration  of  breakdown  to  chaos  under  parameter  change, 
see  Problem  13. 


Problems 

1.  For  the  logistic  map  (D.6)  with  r  non-negative: 

(a)  Show  that  there  is  a  point  attractor  for  0  <  r  <  3. 

(b)  Show  that  there  is  a  two-cycle  attractor  for3<r<l  +  i/6. 

(c)  Show  that  a,  b,  s  can  be  found  such  that  yn  satisfies  the  logistic  map 
with  parameter  s      r)  and  xn  =  a  +  byn. 

(d)  Hence  determine  the  principal  period-doubling  bifurcation  points  for 
the  logistic  map  on  the  range  —  2  <  r  <  4. 

2.  Show  that  the  map  yn+i  =  1  —  ayn  is  just  the  logistic  map  (D.6)  for  xn 
with  xn,  yn  related  linearly  by  yn  =  a  +  f3xn  and  a  =  jr(r  —  2).  [This  is 
an  example  of  the  fact  that  all  quadratic  maps  yn+i  =  A  +  Byn  +  Cy\ 
are  essentially  just  the  logistic  map,  in  that  (D.6)  can  be  obtained  by  a 
suitably  chosen  linear  relation  between  yn  and  xn.] 

3.  *  Allowing  r  and  xn  to  be  complex  in  the  logistic  map  (D.6),  find  regions 
of  the  complex  r  plane  for  which  the  map  has  (a)  a  point  attractor,  (b) 
a  2-cycle  attractor.  (c)  Sketch  the  corresponding  regions  when  these 
results  are  expressed  in  terms  of  the  complex  a  plane  for  the  yn  map  of 
Problem  2. 
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4.  The  tent  map 


2xn,  (0<xn<  \) 

2(1  -  xn),     (|  <  Xn  <  1) 


has  unstable  fixed  points.  Show  that  this  map  exhibits  extreme  sen- 
sitivity to  initial  conditions,  in  that  an  uncertainty  eo  in  xo  is  rapidly 
magnified.  Estimate  the  number  of  iterations  after  which  the  range  of 
uncertainty  in  the  iterates  is  the  complete  interval  [0, 1]. 

5.  For  the  cubic  map  xn+i  =  axn  —  x^,  where  a  is  real,  show  that,  when 
\a\  <  1,  there  is  an  asymptotically  stable  fixed  point  X  =  0  and  that, 
when  1  <  a  <  2  there  are  two  such  fixed  points  at  X  =  ±^/a  —  1.  What 
happens  when  a  becomes  >  2? 

6.  Explore  the  one-dimensional  maps  of  Problems  1-5  using  a  pro- 
grammable calculator  or  (better)  a  computer.  Further  interesting  ex- 
amples are  xn+i  =  exp[a(l  —  xn)],  xn+i  =  asin:r„. 

7.  For  the  Henon  map  (D.15)  show  that: 

(a)  when  — 1(1  —  b)2  <  a  <  |(1  —  b)2  there  are  two  real  fixed  points, 
one  of  which  is  asymptotically  stable, 

(b)  *when§(l-6)2  <a<  (1  -  b)2  +  ±(1  +  b)2  there  is  an  asymptotically 
stable  2-cycle. 

[Hint:   Since  M  =  (    ^  the  eigenvalues  of  M  are  Ai,A2  with 


b  0, 

A1A2  =  —b,  Ai  +  A2  =  —  2aX.  To  determine  stability,  it  is  useful  to 
consider  a  sketch  of  the  function  /(A)  =  b/X  —  X  and  look  for  points 
where  /(A)  =  2aX  in  order  to  find  (a,  b)  such  that  both  the  eigenvalues 
satisfy  |A2|  <  1.] 

8.  *Show  that  a  small  circle  of  radius  e  centred  at  any  (X,  V)  becomes  a 
small  ellipse  under  a  single  iteration  of  the  Henon  map  (D.15).  Explain 
how  the  semi-axes  of  the  ellipse  are  related  to  Lyapunov  exponents  Ai,  A2 
and  show  that  Ai  +  A2  =  In  |6|  with  Ai  >  0  >  A2,  implying  simultaneous 
'stretch'  and  'squeeze'. 

9.  *The  Lozi  map  is  (D.10)  with  F(x,  y)  =  l+y  —  a\x\,  G(x,  y)  =  bx,  where 
a,  b  are  real  parameters. 

(a)  When  |6|  <  1  and  \a\  <  1  —  b,  show  that  there  is  one  asymptotically 
stable  fixed  point. 

(b)  Find  the  2-cycle  when  |6|  <  1  and  a  >  1  —  b  and  determine  its 
stability. 

10.  Explore  the  two-dimensional  maps  of  Problems  7-9  using  a  computer. 
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11.  By  calculating  Lyapunov  exponents  examine  sensitivity  to  initial  condi- 
tions of  the  equal-area  maps  of  the  unit  square  (0  <  x,  y  <  1): 

(a)  Arnold's  cat  map  xn+i  =  xn  +  yn,  yn+i  =  xn  +  1yn  (each  modulo 


12.  Consider  a  system  of  two  degrees  of  freedom  with  two  natural  frequencies 
wi,oj2  in  the  light  of  the  discussion  of  periodicity  and  degeneracy  in 
§§14.1,14.6  and  the  real  ratio  aji/u>2-  Show  that  for  numbers  on  the  real 
line: 

(a)  between  any  two  irrationals  we  can  certainly  find  a  rational; 

(b)  between  any  two  rationals  we  can  certainly  find  an  irrational. 

(Note  that,  despite  (a),  there  are  vastly  more  irrationals  than  rationals 
—  unlike  the  former  the  latter  are  'countable',  so  that  the  rationals  are 
'scanty,  but  dense'.  This  emphasizes  that  for  most  systems  periodic- 
ity (closure)  is  relatively  rare,  although  it  is  still  the  case  that  for  any 
irrational  W1/0J2  there  are  rationals  arbitrarily  close  by.) 

13.  *As  an  exercise  on  near-integrable  systems  explore  the  'standard  map' 
(Chirikov-Taylor)  analytically/computationally: 


(Note:  This  equal-area  map  models  the  twist  around  a  Poincare  section 
by  the  dynamics  (as  in  §14.2  and  in  more  detail  in  §D.3),  when  the 
integrable  case  (K  =  0)  is  expressed  in  action/angle  variables  (I,4>).  As 
K  increases  there  appear  resonance  zones,  periodic  orbits  and  bands  of 
chaos  as  tori  I  =  constant  undergo  progressive  breakdown.  See  Chirikov, 
Physics  Reports,  52,  263-379,  1979.) 


(2xn,  \yn)  (0  <  xn  <  §), 

(2arn-l,i(I/n  +  l))  {\<xn<l). 


J„+i  =  /„  +  Ksmcfi,, 

4>n+\  =  0n  +  In+1- 
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CHAPTER  1 

1.  niA/ms  =  3;  3v/4. 

2.  mi/m2  =  r2/r1. 

3.  2mr  =  F  =  F21  +  F31;  r  =  (m2r2  +  mzr-i)/(m2  +  m3). 

4.  0.12m. 

5-  r'i:j  =  nj,  p'i=Pi-  mlV,  F'i:j  =  Fir 

6.  400 N,  300 N. 

7.  arcsinO.135  =  7.76°  E  of  N;  60.6min;  130  km,  8.62°Wof  S. 

8.  1.2m;  (±0.4,0.6,-1.2),  (0,-0.9,-1.2);  7N,  7N,  ION. 
10.  (2.2  x  10"3)0  =  7.9". 

CHAPTER  2 

1.  (x  in  m,  t  in  s)  x  =  -3cos2t  +  4sin2t  =  5cos(2£  -  2.214)  = 
Re[(-3  -  4i)e2"];  t  =  0.322s,  1.107s. 

2.  2  =  —(mg/k)(l  —  cosuit),  oj  =  y/k/m. 

3.  0.447  s,  14.2  mm. 

4.  15.7°  s"1,  6  =  5°cos7rf  -  8.66°  sin vrf  =  10°  cos(7rf  +  tt/3). 

5.  V  =  -GMm/x;  ^2GM(R~1  -  g-1);  Skins"1. 

6.  V  =  jcx4;  y/c/2ma2;  x  =  ±o. 


7.  V  =  ^kx2  —  c\nx;  x  =  \fcfk\  to  =  \j2kjm. 

8.  F  =  —mk,  F  =  mk;  oscillation;  2yj2a/k. 

9.  F  =  kx,  \x\  <  a;  F  =  0,  \x\  >  a;  oscillation  between  two  turning 
points  if  k  <  0  and  S  <  0,  1  turning  point  if  k  >  0  and  E  <  ^ka2, 
otherwise  no  turning  points. 

10.  Earlier  by  (2a/v)  —  (2/u>)  arctan(cija/u). 

11.  (a)  no  turning  points,  (b)  1  turning  point,  (c)  1  or  2  turning  points. 
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12.  x  =  —a;  2tt y '2ma? / 'c;  (a)  |v|  <  ^/ c/ma,  (b)  w  <  —y/c/ma  or 
y/c/ma  <  v  <  y/2c/ma,  (c)  w  >  y/2c/ma. 

13.  z  =  (5/72)(l  -  e-T*)  -  fft/7;  i  -»  -ff/7. 

14.  8.05  s,  202  m. 

15.  Vflfcarctan(1/fc75M):  (1/2/c)  ln(l  + /cu2 /#). 

16.  VflAr  (fffc)"1/2ln(efe'1  -  Ve2kh  -  1). 

17.  ±-s/2(g/l)(cos6  -  cos0());  2-Kyflfg;  9  =  90  cos(^gjlt). 

18.  For  6»  =  7T  -  a,  a  =  (g/l)a;  0.95s;  27TS-1. 

19.  y/c/m(a2  —  x2);  x  =  a ta,nh(yj c/ mat). 

20.  x  =  a;  ir/uj;  y/a2  +  v2/Au>2  ±  w/2cj. 

21.  2  =  (mg/fc)[-l  +  (1  +  7t)e"7t],  7  =  ^fc/m;  16mm. 

22.  1.006s;  5.33°,  1.17°. 

23.  x  =  (v/w)e_7t  smu>t  — >  vte~l1. 

25.  wi  =         +  72  ±  7- 

26.  £  =  \ma\(ul  +  uj2),  W  =  2irm'yw1a\. 

27.  3;  final  velocities:  -6ms"1,  7ms"1,  10ms"1;  T  =  364.5  J. 

28.  vn  =  eny/2gh. 

29.  an  =  c/rnuj2n(l  +  n2). 

31.  t  =  1.017  x  2-Ky/TJg. 

32.  G(i)  =  (e"7-*  -  e-7+*)/m(7+  -  7_),  t  >  0; 
CHAPTER  3 

1.  (a)  V  =  —  \ax2  —  ayz  —  bxy2  —  |&23,  (c)  V  =  —  ar 2  sin  9  sin  ip, 
{i)V  =  -\(a-r)2. 

2.  ifl+fe. 

3.  (i)  0,  (ii)  \a. 

4.  (a)  Tva2,  F  not  conservative;  (b)  0,  F  may  be  conservative. 

5.  F  =  c[3(fc  •  r)r  -  r2k]/r5;  Fr  =  2ccos6/r3,  Fg  =  csinO/r3,  Fv  =  0. 

6.  382m,  883m;  30°;  17.7s,  10.2s. 

7.  2  =  2;  tan  a  -  gx2/2v2  cos2  a;  a  =  it /4  +  /J/2. 

8.  2  =  wx/u  -  gx2/2u2  -  7gir3/3w3;  42.3°,  823m. 

9.  6.89  km,  7.35  km,  7.18  km. 

10.  Auj;  mujH^/r3;  AT  =  \muj2l2 . 

11.  w/2,      =  4fca2/3m;  4fca/3m,  -5fca/6m. 

12.  9  =  2F/ma-  {g/a)sin9,  92  =  AFO/ma  -  (2g/a){\  -  cosff); 
F0  =  0.362  rag. 
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13.  9  =  (1  -sin%/3a,  02  =  (0  -  1  +  cos0)2fl/3a;  F  =  mg{l  +  2  sin  60/6; 
9  =  7tt/6;  thereafter  the  two  bodies  move  independently  until  string 
tautens. 

14.  u  =  (M  +  m)gsma/(M  +  msin2  a), 

v  =  mg  sin  a  cos  a/ (M  +  msin2  a);  a  =  arcsin(2/3)  =  41.8°. 

15.  2  = 


'  sin 


§  sin  20). 


17.  cot  9  =  cot  0o  cos(ip  —  ifo),  (0o,  <fio  constants). 


18. 


4 


Hi-i)' 


Ff 


rsin2  0  ip2)er  + 


19.  (a)  and  (b):  r  =  k/(c  +  g),  0  =  0,  unstable;  (a)  only:  r  =  k/(c  —  g), 
9  =  ty,  stable. 

20.  T  =  \mY,ih2q2,  Pi  =  mh2qi,  et  ■  p  =  mh^i. 

21.  r  =  (p  -  ptp2)ep  +  (pip  +  2pip)ev  +  zk  =  (f  -  r92 
(r0  +  If  9  —  r  sin  0  cos  9  ip2)eg  +  (rsmOtp  +  2r  cos  0  9(p  +  2r  sin0  <p)ev. 

23.  der/d9  =  eg,deg/d9  =  —er,  der/d(p  =  ev  sin0,  deg/dtp  =  ev  cos0, 
dev/dip  =  —  (er  sin0  +  eg  cos0),  others  zero. 

24.  mc2  [(cosh2  A  -  cos20)A+  |sinh2A(A2  -  02)  +  sin  20  A0]  =  F\, 
mc2  [(cosh2  A  -  cos2  0)0  -  \  sin20(A2  -  02)  +  sinh2A  A0]  =  Fg. 

25.  y  =  A  +  acosh[(a;  —  b)/a],  a,  b,  A  constants. 

26.  A  circle,  x2  +  y2  —  2by  =  a2,  b  constant. 
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1.  4.22  x  104km. 

2.  1.61  x  105  km,  0.176  au  =  2.64  x  107  km. 

3.  11.9yrs,  13.1kms_1. 

4.  5.46  yrs. 

5.  38.6kms-\  7.4kms"1. 

6.  1.62ms-2,  2.38kms_1;  25.8ms-2,  60.2kms_1. 

7.  84.4  min,  108  min,  173  min. 

8.  1.0  x  1011  Ms  (assuming  that  the  mass  distribution  is  spherical  —  this 
is  the  mass  inside  the  radius  of  the  Sun's  orbit). 

9.  y/k/ma;  r2  =  |a2(3  +  2  cos  a  ±  V5  +  4cosa);  r  =  2a,  a;  r  =  a,  0. 

10.  U  =  J2/2mr2  +  \k{r  -  a)2;  to  =  ^/TJ2uj0;  J  =  yfhj2uj0;  2.24  radial 
oscillations  per  orbit. 

11.  (2/3vr)  yrs  =  77.5  days. 

12.  Ratio  is  1.013,  2.33  134.  

13.  1/r2  =  mE/J2  +  ^/(mE/J2)2  -  mk/J2  cos  2(0  -  0„). 


448 


Classical  Mechanics 


14.  Hyperbola  with  origin  at  the  centre. 

15.  7.77  days. 

16.  8.8  kms"1,  5.7kms_1;  97°  ahead  of  Earth;  82°  ahead  of  Jupiter. 

17.  4.26i?E,  18.4°;  38.3kms-\  4.87 yrs. 

19.  GMm/2a,-GMm/a. 

20.  x  =  a(e  —  cosh         =  bsinhip;  r  =  a(ecoshtjj  —  1), 
t  =  (abm/  J)  (esinhtp  —  ip). 

21.  5.7kms~1,  opposite  to  Jupiter's  orbital  motion;  5.7kms_1; 
3.9  x  106km  =  56J?j,  23Rj. 

22.  14. Skins"1  at  23.5°  to  Jupiter's  orbital  direction;  9.2AU,  16.2yrs: 
3.6  AU. 

23.  14.3  kms"1,  in  plane  normal  to  Jupiter's  orbit,  at  23.5°  to  orbital 
direction;  7.8  AU,  16.2  yrs;  2.5  AU. 

24.  cos 9  =  (1  -  l/R)/y/l-  I /a;  60°,  6.45 kins"1. 

25.  With  n2  =  |1  +  mk/J%  b2  =  J2/2rn\E\: 
J2  +  ink  >  0,  E  >  0  :  r  cos  n{6  -  60)  =  b; 
J2+mk  =  0,E>0:r(6-  60)  =  ±b; 

J2  +mk  <  0,E  >  0  :  rsinhn(6»  -  60)  =  ±b; 

J2  +  rnk  <  0,  E  =  0  :  re±n6  =  r0; 

J2  +mk  <  0,E  <  0  :  rcoshn(6»  -  0O)  =  b. 

26.  do-/dn  =  kTT2(iv  -  e)/mv262(2Tv  -  6>)2  sin6>. 

27.  u>  =  \/(— ka2  —  c)/ma5,  u>'  =  \J (—ka2  +  c)/ma5. 

28.  0.123m,  2.44m. 

29.  1.13  x  10-nm,  8.1  x  lO^"1. 

30.  r  =  eJsm9/ml,  r  =  eJ2  cos 9/m2lr2,  rad.  accel.  =  —J2/m2lr2. 
CHAPTER  5 

1.  2.2ms"2  =  0.086 c/j,  5.1  x  lO^ms"2  =  1.9  x  lO"5^- 

2.  15.3s"1. 

3.  0.20mm,  78  mm. 

4.  465ms"1;  (a)  542ms"1,  (b)  187ms"1,  (c)  743ms"1. 

5.  (a)  99.88  twt,  (b)  100.29 twt,  (c)  99.46  twt. 

6.  2.53  x  10"3N  to  south,  1.46  x  10"3N  up. 

7.  O-OWmbarkm-1. 

8.  0.155°. 

9.  12.0  s;  lOOkgwt,  20kgwt;  decreasing  weight  and  a  Coriolis  force  of 
79  N. 

10.  47.4mm. 
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11.  F  =  mr  =  q(E  +  v  A  B)  with  E  =  Ek,  B  =  Bk; 

x  =  (rnv/qB)sm(qBt/m),y  =  (mv/qB)[cos(qBt/m)  —  1], 
2  =  qEt2 /2m;  z  =  (2mE/qa2B2)y2;  depends  only  on  m/q. 

12.  I  =  nmv/qB;  E  =  2x  106  Vm"1,  I  =  0.089m. 

13.  ~  105T,  1.76  x  lO1^"1. 

14.  5.1  x  1016s_1,  -3.3  x  1016s_1. 

15.  124m. 

18.  mr  =  F  —  ma. 


X* 

COSUlt 

—  sin  Lot  0 

X 

19. 

y* 

sixiuit 

cos  Ult  0 

y 

z* 

0 

o  i 

z 

20.  T  =  \m{r  +  oj  f\r)2. 
CHAPTER  6 

1.0=  (q/2ire0a2)(Va^T^  -\z\),E=  (qk/2ne0a2)        -  -rf-p) : 
E  ^  k(a/2e0)(z/\z\). 

2.  Q  =  -\qa2;  for  6  =  0,  4>  «  (g/47re0)(l/z  -  a2/4z3). 

3.  When  d  is  in  same  direction  as  E. 

4.  E  =  (3d-  rr  -  r2d)/4ire0r\  V  =  (r2d-d'  -  3d  ■  r  d1  ■  r)/4-Ke0r5; 

(a)  F  =  —F'  =  -6k(dd'/4ire0r4),  G  =  G'  =  0; 

(b)  F  =  —F'  =  3fc(cid747re0r4),  G  =  G'  =  0; 

(c)  F  =  —F'  =  3i(dd'/4ire()rA),  G  =  -j{dd'/4ire0r3),  G'  =  2G; 

(d)  F  =  F'  =  0,  G  =  —G'  =  -k(dd'/4neQr3). 

5.  V  =  ^ijijiUQj/^Treonj). 

6.  I(q2/4ne0a);  \(q2/4^a). 

7.  4.5  x  105C,  0.28  Jm"2. 

8.  8naoa4/5,  2na2a3 /25e0. 

9.  3ga2(x2  -  y2)/47re0?"5, 

£  =  (3ga2/47re0r7)  ((3a;2  -  7y2  -  2z2)x,  (7x2  -  3y2  +  2z2)y,  5(x2  -  y2)z) . 

10.  -Gm/r  +(Gma2/ri) (3  cos2  6  -  1); 

-Gm/a  +  (Gm/4a3)(2z2  -  x2  -  y2),  g  =  (Gm/2a3)(x,y,  -2z). 

11.  -6Gm/r  +  (7G,ma4/4r5)[3  -  5(o:4  +  y4  +  z4)/r4]. 

12.  -^(STrGpoa3^)^"1  -  a"1);  6.7  x  106yrs;  14.9 mins,  29.5 mins. 

13.  4.0  x  104(,J. 

14.  1/9.5. 

15.  3.0. 

16.  79  m;  78i?E- 

17.  (Me/Mm)2(J?m/-Re)4  =  35;  2.8km. 
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18.  12.5m. 

19.  (8TT/5)p0d0r4;  1.13. 

22.  qfi2e-vr/4nE0r;  -q. 

23.  2nGp2R2/3  =  1.7  x  1011  Pa=1.7Mbar. 

24.  $  =  -kp;  a  =  ^kjAG. 

25.  -1.43  x  10-6s_1;  51  days. 

26.  17.9  yrs  (should  be  18.6  yrs). 

CHAPTER  7 

1.  258  days. 

2.  7.4  x  10s km  from  centre  of  Sun,  i.e.,  just  outside  the  Sun;  0.28°. 

3.  0.00125 M0;  mx  >  0.00125 M0. 

4.  z\  =  I  +  mivt/M  -  \gt2  +  (m2v/Muj)  smut, 

22  =  mivt/M  —  \gt2  —  (miv/Moj)  sin  cut,  with  ui  =  \fkfp,  and  v  <  lui. 

5.  mxjmn  =  1. 

6.  12;  0.071. 

7.  62.7°,  55.0°,  640 keV. 

8.  T*  =  m2Q/M,  T2*  =  rmQ/M;  3.2MeV,  0.8MeV. 

9.  Inl06/ln2  «  20. 

10.  90°;  45°, 45°. 

11.  2.416;  (0.65w,0.15w,0),(0.35v,-0.15v,0). 

12.  T*/T  =  m2/M;  ->  1  or  0. 

13.  3  x  10_6,+450km,  +2.4min. 

15.  a2  cos  0(1/  sin4  0  +  1/  cos4  0),  where  a  =  e2 /2ireo7nv2 .  (The  second 
term  comes  from  recoiling  target  particles.) 

16.  1.8  x  103s-\  same  for  both. 

17.  2mR  =  0,  \mr  =  qE  -  (q2 /47re()r3)r;  z  =  2qE/muj2. 
CHAPTER  8 

1.  0.99  kms"1,  164kg. 

2.  (2.44+  1.48  =)3.91kms-\  143kg. 

3.  4.74kms"1. 

4.  3  stages,  1.48  x  105kg. 

5.  14.2kms-1,  2.06 kms"1,  2.8  x  106kg. 

6.  \M0u2{\  -  e""/"). 

7.  44.6  km,  33.9  km. 

8.  lO^kms-1,  (3.07  -  0.07  =)  S.Okms"1;  6.13t. 

9.  If  ui  =  {v,0):  (-l,0)u/5,  (3,±y/3)v/5,  (\v2\  =2y/3v/5). 
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10.  —pAv2,A  =  irr2;  because  scattering  is  isotropic. 

11.  5a  =  -2(//m)x/(l  +  e)a3/GM(l  -  e). 

12.  da/At  =  dZ/cU  =  -2pAva/m. 

13.  -20.9s,  -16.9km;  -3.37s,  -2.78km. 

14.  (a)  6.2 h,  1.85 d;  (b)  3.15d,  40.4d. 

CHAPTER  9 

1.  4v/2a/3,  {Z^g/Aa)1'2. 

2.  64r.p.m.,  5.3  x  10~6  J  from  work  done  by  insect;  dissipated  to  heat. 

3.  (a)  E,P;  (b)  J  about  leading  edge;  (c)  E;  3w/8a,5/8; 

[16(^/2  -l)5a/3]1/2. 

4.  (a)  1.011s,  (b)  1.031s. 

5.  4a/3;  3feX/4Ma2,  36X/4a;  6  =  4a/3. 

6.  (a)  3Mc/cos^;  (b)  (Mg/8)(-9 sin2<^,  11  +  9 cos 2cp), 
(3M#/2)(-  sin  2^, 1  +  cos  2^). 

7.  9  x  10-6kgm2,  16  x  10-6kgm2,  25  x  10-6kgm2; 
(1.08, 1.44,  0)  x  10"4  kgm2  s"1;  6.3  x  10"3  N. 

8.  (a)  (8,8,2)Ma2/3;  (b)  (11, 11,  2)Ma2/3. 

9.  2M(a5  -  65)/5(a3  -  63). 

10.  25.6s,  1.097  x  103  J. 

11.  60°. 

12.  h  =I2  =  3M(a2  +  4h2)/ '20, I3  =  3Ma2/10;  1/2;  Z  =  3h/4, 
I*  =  I*  =  51Afa2/320, If  =  I3. 

13.  1.55s. 

14.  112s. 

15.  0.244s"1. 

17.  8.83m. 

18.  (a)  2.64Hz  (f2  =  16.6s"1);  (b)  3.44Hz  (fi  =  21.6s"1). 

20.  2.50  x  lO-^s"1  =  16.3" yr"1. 

21.  22.3s. 

CHAPTER  10 

1.  ±g/±  

2.  y/4mgl/(M  +  2m)a2. 

3.  Mmg/(M  +  2m). 

4.  M2rng/k(M  +  2m)2. 

6.  g/7,3g/7,-5g/7. 

7.  24mg/7,  \2mgjl. 
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8.  62.62s"1,  4.347s"1  (cf.  4.065s"1);  371.5s"1  (3548r.p.m.). 

9.  (M  +  m sin2  6)16  +  ml62  cos 6 sin 6  +  (M  +  m)gsm6  =  0;  1.40 s"1 . 

10.  Ixfi  =  sin  \  cos  ip  —  Ii£l2  sin2 \sm  ip  cos  ip;  (Ii/I^Q  sin  A;  east 
and  west. 

11.  I±,  I3  are  replaced  by  7*  <  =  I3;  large  fl  is  bigger,  small  fl  is 
slightly  smaller. 

12.  arcsin(l/v/3)  =35.3°. 

14.  (a)  as  at  t  =  0  except  that  for  1/2  —  ct  <  x  <  1/2  +  ct,  y  =  a  —  2act/l; 
(b)  y  =  0;  (c)  y(x,l/c)  =  -y(x,0). 

15.  x  -  2ujy  -  uj2x  =  -GM^x  +  oi)/rf  -  GM2(x  -  a2)/rf , 
i/  +  2w±  -  a;2y  =  -GMxy/r\  -  GM2y/r%,  with 

r2  =  (x  +  ai)2  +  y2,  r\  =  (x  -  a2)2  +  y2,uj2  =  GM/a3. 

CHAPTER  11 

1.  x  =  a(cos  uiit  cos  u>2t  +  5-^/2 sinwifsinu^i), 
y  =  a{2  cosuit  cosui2t  +  §y2sina;ii  sina^t), 
where  oji,2  =  |(a;+  ±        and  u±  =  i/(2  ±  y/2)g/l. 

2.  uj2  =  g/l,g/l  +  k/M  +  k/m;  Ax/Ax  =  1,-m/M;  2Ma/(M  +  m);  no. 

3.  xq  =  2mg/k,  yo  =  3mg/k;  uj2  =  (3  ±  y/5)k/2m. 

4.  uj2  =  u>q,uJq  +  u2,ul  +  3d2; 

Ax  :  Ay  :  Az  =  1  :  1  :  1,  1  :  0  :  -1,  1  :  -2  :  1. 

5.  2a/3,a. 

6.  (a)  uj2  =  k/m,3k/m;  (b)  uj2  =  (1  -  a/l)k/m,  3{1  -  a/l)k/m. 

7.  oj2  =  (M  +  m)g/ma,g/2a. 

8.  6  =  (ifio/10) (cos 27rt  —  cos37rf)  (t  in  s);  ipo/5,t  =  Is. 

9.  1.0025  s,  0.09975  s;  0.5025  mm. 

10.  (0. 401  sin  6. 27t  -  0.0399  sin  63.  Of)  mm,  (tins). 

11.  0  =  (g/47re0)[14/a  +  (4a;2  +  y2)/2a3];  w2  =  g2/47re0a3m,  u\  =  4a;2. 

12.  a;2  =g/l,g/l,3g/l. 

13.  Ai>2  =  (F / \/2m){u)2  2  -  ui2  +  2i7i,2w),  with  a;2  =  a;2,  w|  =  a;2  +  2oj2, 
71  =  a/2m,  72  =  (a  +  2/3)/2m;  a  >  y/3k/ojo- 

14.  g2r  =  0,  g2r+i  =  (- l)r4v^Ia/ir2 (2r  +  l)2. 

CHAPTER  12 

1.  a;2  =  gcosa/rsin2  a;  arcsin(l/v/3)  =35.3°. 

2.  n2  =  lo2  -  g2/lW. 

3.  H  =  (jpx  -  py)2/Qm  +  p2y/2m  +  \ky2  -  mgy; 

y  =  mg(l  —  cosu>t)/k,  x  =  xq  —  y/4,  ui2  =  4fc/3m. 
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4.  J2  =  p2e  +  p2/ sin2 


5.  H 

6.  H 


+ 


2ml'- 
P2x+Py 


(Ipx  -  pe  cosfl)2 
2(M  +  msm2  <9)Z- 


+  mgl(l  —  cos9). 


2M        2  (J*  +  Mi?2  sin2  0) 


+ 


(gg  -Py-  cosg)2 
2I{  sin2  61 


4I*MgR/I%,  reduced  by  a  factor 


+  ^+MgR  cos  6;  ^>min 

7.  (pv  -  p^zf  -  2h(l  -  z2)(E  -  Mgrz  -  p\ /2J3)  =  0. 

8.  #  =  (p  -  qA)2/2m  +  q<p. 

9.  H  =        +  -  +  12B2P\ 

2m     2mp2      Aireop  2m 

10.  ^/q'm/ire0qB2; 

11.  0  <  b  <  a,  a2/ib. 

12.  Mj/M2  >  1(25  +  7621) 

13.  n2  -  -~2 

147.4  yrs. 

A    .       J     dU  J2 

14.  tp 


24.96. 


39'  =  |w2{l  ±  yjl  -  27M1M2/(M1  +  M2)2};  11.90yrs, 


mp2  'dp  mp3 


qJBz 
mp 


dU 

7h 


qJBp 
mp 


CHAPTER  13 


1.  (a)  /  =  k2/4a;  equilibria  x±  =  (k  ±  \Jk2  —  4fa)/2a,  x+  is 
asymptotically  stable,  (b)  x  <  f  —  f  <  0  =>  to  <  k/a(f  —  /).  In 
practice,  /  is  not  precisely  known  or  constant. 

2.  Trajectories  |#2  —  (g/l)cos8  =  constant  (=  g/l  on  separatrices). 

4.  x  =  y,  y  =  —  uJqX  —  py.  Critical  point  (0,  0)  which  is  (i)  asymptotically 
stable  spiral,  (ii)  (improper)  asymptotically  stable  node,  (iii)  (inflected) 
asymptotically  stable  node. 

5.  (0,0)  and  (1,0)  are  asymptotically  stable  nodes;  (|,0)  is  a  saddle. 
[Note  that  the  nodes  are  local  minima  of  U(x,  y).] 

6.  f(x)  =  —  c\nx  +  dx  has  a  single  minimum  and  /  — >  +oo  when  x  — >  0+ 
and  when  x  — >  +oo. 

7.  (0,0)  has  A12  =  1,  \  with  eigenvectors  (1,0),  (0, 1):  unstable  node; 
(0,2)  has  Ai_2  =  —1,  —  \  with  (1,3),  (0, 1):  asymptotically  stable  node; 
(1,0)  has  Ai_2  =  —  1,—  \  with  (1,  0),  (1,  — |):  asymptotically  stable 
node;  (|,  |)  has  Ai,2  =  (—5  ±  \/57)/16  with  (—  |,  \  +  A,):  saddle. 
Nearly  all  initial  conditions  lead  to  extinction  for  one  or  other  species 
in  Fig.  13.9  ==>  no  stable  coexistence.  Second  set  of  parameters 
=>  (0,0):  unstable  node;  (0,  |)  and  (|,0):  saddles;  (\,l): 
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asymptotically  stable  node;  in  this  case  there  is  asymptotically  stable 
coexistence. 

8.  M=   ~Cl  a2     =>  A2  +  (ci  +  C2)A  +  (C1C2  -  0102)  =  0.  Then 

_  Ol  -C2_ 

C1C2  >  a\a2  =>  Ai,A2  real,  negative:  asymptotically  stable  node, 
coexistence  in  first  quadrant;  and  C\Ci  <  a\a2  ==>  \\,  A2  real, 
opposite  sign:  saddle  in  third  quadrant  and  trajectories  run  away 
— >  +00. 

9.  Critical  points  at  (0,0),  ,  a(o+jl) ]  •  ^  <  1;  asymptotically 
stable  node,  saddle.  R  >  1:  saddle,  asymptotically  stable  node.  Disease 
maintains  itself  only  when  i?  >  1 . 

10.  At  (0,0),  Ai  2  =  (e±  Ve2  -  4)/2;  unstable  spiral  (e  <  2)  or  node 
(e>2). 

11.  At  (0, 0),  A12  =  a/3  ±  i,  so  we  have  an  asymptotically  stable  spiral 
(f3  <  0),  unstable  spiral  {(3  >  0)  or  centre  {(3  =  0).  In  polars,  #  =  1 
and  r  =  ar(/3  —  r2);  for  /3  >  0,  r  — >  v7?  as  i  and  #  — >  00;  when  /?  <  0, 
r  ->  0. 

12.  (a)  J\J\  +  J2J2  +  J3J3  =  0;  then  integrate  to  get  \  J\  =  constant,  (b) 
(±  J,  0,  0),  (0,  0,  ±  J)  are  centres  on  the  sphere;  (0,  ±  J,  0)  are  saddles. 

(c)  j3  =  0  =>  J3  =  /3fi-  Ji,2  satisfy  J,  +  [(A  -  h)/h]2n2  Jt  =  0. 

(d)  Critical  point  (0,0,0),  with  Ai^^  =  —  \fi\/Ii,2,3  and  eigenvectors 
(1,  0,  0),  (0, 1,  0),  (0,  0, 1).  As  uj  ->  0,  oj  ->  rotation  about  axis  with  the 
largest  moment  of  inertia,  13. 

13.  (a)  dA/dt  =  0  and  dl/dS  =  i/S  =  -l  +  b/aS.  (b)  dl/dS  =  0  when 
5  =  b/a  and  d2//dS'2  =  —b/aS2  <  0,  so  maximum.  I  =  0  at 

5  =  S1-,  S+.  (c)  5*0  =  b/a +  5,  To  =  e.  Write  S  =  b/a  +  £  and  expand 
=>  trajectory  passes  through  (fo/a  —  5,  e).  Since  So  can  be  made 
arbitrarily  close  to  S-,  so  S+  is  arbitrarily  close  to  b/a  —  5  and  these 
susceptibles  escape  infection. 


/ 
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14.  For  a  critical  point  (X,  Y,  Z),  the  matrix  M 

-i(a  +  i)±!vT^TTF 


—a 
p-Z 
Y 


a       0  ' 

-1  -X 

X  -(3 


p)a,  so  that 


(a)  Ai  =  -/?  <  0,  A2,3  =  -±(<r  +  1)  ±  ±V(CT  +  !)2  "  4(1 
A2,3  are  real  and  are  both  negative  only  when  0  <  p  <  1. 
(c)  p  =  1  =>■  cubic  becomes  A(A  +  f3)(X  +  a  +  1)  =  0.  If  the  cubic 


has  the  form  (A  +  p)(X' 
pv2  =  2a0(p  -1)  => 


+  v2)  =  0  then  p  =  a  +  P  +  l,  v2  =  (3(a  +  p), 
result  for  pcrit.  (d)  Write  p  =  1  +  e  (with  a,  p 


iv 

(o+}  + 1)    -(cr  +1)     -p  ^ 

0 

—  iv 

constant,  e  small)  and  find  changes  in  eigenvalues  by  putting 
A  =  Ai  +  £(e)  and  performing  a  linear  analysis  of  the  cubic  in  (b). 
(e)  Evidently  RHS  =  0  is  an  ellipsoid  and  the  distance  of  (x,  y,  z) 
from  O  =  (0, 0,  p  +  a)  decreases  where  RHS  <  0,  i.e.,  outside  the 
ellipsoid  and  so  a  fortiori  outside  any  sphere  centred  at  O  which 
contains  it. 

15.  (b)  For  a  critical  point  {X\,  X^,Y\  the  matrix 

"  -p      Y  Xi 
M=    Y-A    -p  X1 

_  -X2    -X1    0  _ 
(c)  V  ■  (x)  =  -2p  <  0.  (d)  X\  +  Xl  +  Y2  =  C,  where  C  >  0,  is  an 
ellipsoid  (oblate).  Trajectories  move  towards  smaller  C  values 
whenever  (Xi,X2,Y)  is  below  the  paraboloid  Y  =  p(X2  +X%)/y/2. 

16.  Ax  =  Axq  cosut  +  (Ayo/ui)  smut,  Ay  =  —u>  Axq  s'muit  +  Ayo  cosut. 
Each  is  bounded  and  so  is  \J (Ax)2  +  (Ay)2. 

17.  Trajectories  in  the  rcy-plane  are  parabolic  arcs  in  x  >  0.  The  higher 
the  energy  the  longer  the  period  of  oscillation  (between  successive 
bounces).  Between  bounces,  Ax  =  Axq  +  Ay^t,  Ay  =  Ay(),  so  that 

+  (Ay)2  ~  Kt  for  t  increasing,  and  bounces  don't  affect  this 


J(AxJ2 
result. 


18.  Evidently,  we  have  (1),  (|,  \),  (\ 


v4'  2'  ii 


etc. 


(\)n  x  (row  of  Pascal's  triangle).  When  n 


m  successive  rows,  i.e., 
=  16,  we  have 
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(±)16[1, 16,120,560,1820,4368,8008, 11440,  12870,11440,8008,4368, 
1820,560,120,16,1]. 

CHAPTER  14 

1.  Return  trajectories  have  — <  E  <  0. 

to  =  ^2h3m/\k\  [tt/2  -  &rcsm(^R/h)  +  sJ(R/h){\  -  R/h)]  -> 

y/h3m/2\k\  7r  as  R/h  — >  0,  i.e.,  Kepler's  third  law  for  a  flat  ellipse, 
major  axis  h  in  this  limit. 

2.  Explicitly,  [F,H]  =  0  in  each  case.  In  (c),  the  vector  A  may  be 
considered  using  Cartesians  or  polars.  (Then  the  orientation  of  the 
orbit  may  be  specified  using  a  =  &Tct&n(Ay  /  Ax)  and 

\A\2  =  1  -  (2J2\E\/mk2),  consistent  with  (4.30).) 

3.  H  (=  E)  and  pg  are  constants  of  the  motion.  Steady  r  =  tq  when 
pg  =  mrfao  with  Qq  =  ±y/g/ro.  Small  oscillations  r  =  ro  +  A  => 
SHM  for  A  with  frequency  •y/3g/2ro. 

4.  u>  =  J g  cos  a/ro  sin2  a,  rotation  rate  at  ro-  Small  oscillations 

r  =  r-Q  +  A  =>  SHM  for  A  with  frequency  zu  =  y3(wsina).  Closure 
for  rational  ^/3sina. 

5.  Effective  potential  p2e /2mr2  —  | k \  jr  ==>  minimum  —k2m/2pg  when 
r  =  ro  =  Pg/m|fc|,  circle;  (r,pr)  curves  closed  (around  r  =  ro)  when 
-E  <  0,  corresponding  to  (r,  6)  ellipses;  E  >  0  curves  stretch  to 

r  — >  oo,  (r,  0)  hyperbolae,  parabola.  We  always  have  closure  for  this 
system  —  given  E,  Poincare  section  is  a  single  point  on  an  (r,pr) 
curve,  a  different  point  for  each  choice  of  6  section. 

6.  H  =  E  here.  Action  /  =  2E-s/l/g  =>■  E  =  \\fgjll,  and  frequency 
u  =  dH/dI=\y/gTl.   

7.  Oscillation  between  x  =  ±[Z  +  y/2E/k].  Action 

I  =  (2l/n)V2~^E  +  E/n  =^>  VE  =  yTft  +  (32  -  (3. 

<f>(x)  =  (d/dl)  Jx  pdx  with  <f>  =  u>(I)t  +  constant, 

oj  =  Qy/E/(y/E  +  f3),T  =  2-k/lj.  Small  E  ==>  T  ->  oo  as  £-1/2; 

large  £  =>  T  ->  27r/fi. 

8.  J  =  Iv^mfc^/fc  -  \A/fc]  =>  E(I);4>(q)  =  (d/dI)fqPdq  =  uit  +  P, 
with  w  =  dE/dl  =  2y/k/m.  Evaluating  <f)(q)  gives 

1  /2 

g(t)  =  [f  +  VEl~Xk  sm(ujt  +  j3)]      .  (Note.  For  the  isotropic 
oscillator  there  are  two  radial  oscillations  for  each  complete  elliptical 
trajectory.) 
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9.  Action  Ix  =  (\/-2mE/n)       i/(r2  ~~  r)(r  ~~  r1)(dr/r),  where 

rx  +  r2  =  -\k\/E  and  T'ir2  =  -I^/ZmE  =>■  result.  So  E(Ii,h)  and 
=  =  w2  (see  (14.18)). 

10.  Consider  the  lines  from  a  bounce  point  to  the  foci  and  the  angles  they 
make  with  the  trajectory  just  before  and  just  after  the  bounce. 

A  =  [(a:  +  ae)py  —  pxy]  [(x  —  ae)py  —  pxy\.  Change  to  A,  8  variables;  H 
from  Chapter  3,  Problem  24  using  p\  =  mc2(cosh  A  —  cos2  6)\, 
pg  =  mc2 (cosh2  A  —  cos2  6)6.  Turning  value  for  A  is  when  p\  =  0  ==>■ 
tangency  condition. 

11.  (a)  Motion  within  curve  (E/mg/ir)  =  1  —  (l//i)cos#  (ellipse). 

(b)  /J,  =  1:  bounding  curve  E  =  2mgrsm2(6/2);  fj,  <  1:  curve  as  in  (a) 
—  hyperbola  for  E  ^  0,  two  straight  lines  for  E  =  0. 

12.  Hamilton's  equations:  x  =  px  +  coy,  px  =  uipy  —  u)2x  +  dU/dx, 

y  =  py  -  lux,  py  =  -ujpx  -  uj2y  +  dU/dy  and  H  =  \(px  +  ujy)2+ 
\(j)y  —  ujx)2  —  U .  System  autonomous,  H  =  constant;  U  >  C  for 
possible  motion  and  for  the  Earth/Moon  system  the  critical  C  is  that 
corresponding  to  the  'equilibrium'  point  between  Earth  and  Moon. 

13.  Action  I  =  E/u),  so  E  oc  u>  oc  i-1/2.  Maximum  sideways  displacement 
=  Wmax  oc  I1/4.  Maximum  acceleration  |o;2/6'max|  oc  Z-3/4. 

14.  Action  I  (x  L\f~H  and  H  oc  v2  =>  v  oc  1/L.  Temperature  oc  1/L2 
=>  pressure  oc  1 /L5  pressure  oc  (density)5^3. 

15.  Action  I  =  (2m/3Tr)y/2gqQ  sin  a  and  E  =  mgqosma  =^  result. 
Frequency  oj  =  2n^g  sin  a/8qo-  Evidently,  I  =  constant  ==> 

E  oc  (sina)2/3,  qo  oc  (sina)-1/3,  period  2-k/oj  oc  (sina)-2/3.  So 
E2/E1  =  0.69,  902/901  =  1-20,  ujx/uj2  =  1.44.  

16.  Evidently  E  oc  k2  and  r  oc  k~2.  Since  r  oc  1/a3 /|fc  (see  (4.31)), 
aocl/|fc   ==>  fc  decreases,  a  increases.  Since  eccentricity 

e  =  a/I  +  2EI2/mk2  (see  (4.30)),  e  remains  constant. 

APPENDIX  A 

1.  (a)  1,2,3;  (b)  (-3, -3,  3),  (-3,  7,  8),  (-6, 4, 11);  (c)  -15,15; 
(d)  (-3,  3,0),  (-1,3, -3);  (e)  as  (d). 

2.  164.2°,  16.2°. 

4.  (3x2  —  yz, —xz, —xy);  6x. 

5.  (a)  (1  -  y2/x2,  2y/x,  0),  (2x/y,  1  -  x2/y2,  0);  (b)  circles  passing 
through  the  origin  with  centres  on  x-  or  y-axes,  respectively, 
intersecting  at  right  angles. 
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8.  a(V  ■  b)  +  (b  ■  V)a  -  b(V  •  a)  -  (a  •  V)b. 

9.  2u)k. 

11.  (V2A),  =  V2(^)-1(ap  +  2^), 


(V2A)„  -  „  />2 


V'(AV)  -  -z  (A 


.dAf 

dip 


12.  Ar 


0,  Au 


Ho  \x  sin  9/Airr 


2. 


(V2A) 


V2(AZ 


fj,oHCOs9/2irr3 ,  Bg  =  /io/isin6*/47rr3,       =  0; 
A  =  (^0/47rr3)/x  Ar;  B  =  (/i0/47rr5)(3/x  •  rr  -  r2/x). 

13.  B  =  (fi0I/4Trr3)ds  A  r;  F  =  (fi0H' /4irr3)ds'  A  (ds  A  r); 
F  +  F'  =  (Mo//'/47rr3)r  A  (ds  A  ds')  ^  0. 

14.  Ar  =  r_1cos#,       =  r-Mnrsin^,  Av  =  0;  r"2(l  +  21nr)  cos#; 
(0,0,2r-2sin6>). 

cos  6     0    sin  t 

15.  0  10 
—  sin  6   0    cos  ( 

17.  Q  =  JX/>(r')(3rV  -  r,2l)dV. 


APPENDIX  B 

1.  Distances  n,2  are  given  by 

r2  2  =  (acosip  =p  ae)2  +  (&sini/>)2  =  a2(l  =p  ecos'</>)2. 

2.  Tangent  vector  is  t  =  (— asin-0,  b  cos  ip),  unit  vectors  from  two  foci  are 
ni.2  =  (1  =F  ecosi/>)_1(cos</>  =F  e,  (Va)  sin^),  scalar  products  are 

t  ■  «i,2  =  ±ae  sin  ijj. 


APPENDIX  C 

1.  (a)  (0,0),  Ai_2  =  — |  ±  5VT7,  eigenvectors  (1,3  +  Aj),  asymptotically 
stable  node. 

(b)  (0,  0),  Ai_2  =  4, 1,  eigenvectors  (1,  Ai  —  3),  unstable  node. 

(c)  (—1,  —1),  Ai_2  =  — 5  ±        eigenvectors  (—2,  A^  +  8),  asymptotically 
stable  node; 

(4,  4),  Ai_2  =  5  ±  iv/55,  unstable  spiral. 

(d)  (0,2),  A12  =  — l±i,  asymptotically  stable  spiral; 

(1,  0)  Ai_2  =  1,  —2,  eigenvectors  (—1,  Ai  +  2),  saddle  (:.  unstable). 

(e)  (0,2),  Ai,2  =  ±i2i/6,  centre  (:.  stable): 
(0,  —2),  A1.2  =  ±12-^/6,  centre  (.'.  stable); 
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(1,0),  Ai_2  =  —8,3,  eigenvectors  (1,0),  (0,1),  resp.,  saddle 
(.-.  unstable); 

(—1,  0),  Ai,2  =  8,  —3,  eigenvectors  (1,  0),  (0, 1),  resp.,  saddle 
(.-.  unstable). 

(f)  (0,ri7r),  n  odd:  Ai,2  =  ±i,  centres  (.'.  stable); 

n  even:  Ai^  =  ±1,  eigenvectors  (1, 1),  (1,  —1),  resp.,  saddles 
(.-.  unstable). 

(g)  (0,0),  Ai_2  =  ±V^2  —  ct,  so  u)2  >  a,  saddle  (.'.  unstable)  and 
cu2  <  a,  centre  (.'.  stable). 

(Note:  In  each  case,  consider  local  sketches  near  the  critical  points  and 
how  these  build  into  the  global  phase  portrait.) 
2.  x  =  y,  y  =  x3  —  x  =>■  (0,0),  centre,  (±1,0),  saddles; 

(a)  dy/dx  =  (x3  -  x)/y  =>  \y2  =  \x4  -  \x2  +  c,  i.e., 

y  =  ±^Jt;X4  —  x2  +  2c;  symmetry  about  x  axis  rules  out  spirals; 

(b)  separatrices  given  by  c  =  \. 

APPENDIX  D 

1.  (b)  -^F{2\x)  =  F'(x)F'(F(x))  =>  asymptotically  stable  when 
\F'(X1)F'{X2)\  <  1  =^>  |  -r2  +  2r  +  4|  <  1  =>  3  <  r  <  1  +  y/Q  (if 
r  >  0). 

(c)  a  =  1  -  l/r,b  =  2/r-  l,s  =  2  -  r. 

(d)  Applying  (c)  to  (a),  (b),  etc,  yields  asymptotically  stable 
points/cycles: 

s=2-r  Y  r  X 

0^1  0  2^1  1  -  1/r 

1^3  1-1/s  1^-1  0 

3^1  +  ^/6  (Fi(s),F2(s))  -1^1-^/6  (X1(r),X2(r)) 

3.57  2°°  accum.  -1.57  2°°  accum. 

4  range  limit  —2  range  limit 


2.  r  =  1  ±  VI  +  4o,  /3  =  -2a  =  r/a. 

3.  (a)  X  =  0  asymptotically  stable  in  circle  |r|  <  1,  X  =  1  —  1/r 
asymptotically  stable  in  circle  |2  —  r\  <  1. 

(b)  asymptotically  stable  2-cycle  when  —  r2  +  2r  +  4|  <  1,  or 
r  —  1  —  \/5\.\r  —  1  +  i/5|  <  1  (two  disjoint  ovals). 
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Note:  If  we  continue  in  this  way,  adding  the  regions  of  the  complex  r 
plane  in  which  there  are  cycles  of  any  finite  length  we  arrive  at  the 
figure  shown: 


Complex  r  plane 


(c)  In  the  complex  a  plane,  the  circles  become  a  heart-shaped  region 
with  a  cusp  at  a  =  —  j,  the  2-cycles  yield  a  circle  \a  —  1|  =  \.  The 
entire  set  yields  the  Mandelbrot  'signature'  (see  Peitgen  and  Richter, 
The  Beauty  of  Fractals,  Springer,  1986): 


Complex  a  plane 

4.  Fixed  points  at  X  =  0,  X  =  §  have  \F'(X)\  =  2  =>  instability. 
Uncertainty  en  =  ea2n  >  1  when  n  >  ln(l/e0)/ln2.  Note:  en  =  e02n 
=>  Lyapunov  exponent  A  =  In  2  >  0. 

5.  X  =  0  asymptotically  stable  if  \a\  <  1.  X  =  ±V 'a  —  1  asymptotically 
stable  when  |3  —  2a\  <  1,  or  1  <  a  <  2.  At  a  =  2,  there  is 
period-doubling  on  each  branch,  followed  by  a  period-doubling  cascade. 
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7.  (a)  Both  |Ai|,  |A2|  <  1  =^>  |6|  <  1  and  also  \2aX\  <  1  -  b.  (X_,  Y_)  is 
always  unstable;  (X+,Y+)  is  asymptotically  stable  if  a  <  |(1  —  b)2. 
(b)  Nontrivial  2-cycle  =>  X,  roots  of 

a2X2  -  o(l  -  b)X  +  (1  -  b)2  -  a  =  0,  with  Y  =  6(1  -  aX2)/(l  -  6);  real 
roots  if  a  >  |(1  -  6)2.  Eigenvalues  A  of  MiM2  satisfy 
A2  -  (4a2X1X2  +  2b)X  +  b2  =  0  =>  for  asymptotic  stability  \b\  <  1 
and  |4(1  -  b)2  -  4a  +  2b\  <  1  +  b2  =>  a  <  (1  -  6)2  +  ±(1  +  b)2  [using 
an  argument  similar  to  that  in  the  Hint] . 

8.  xn  =  X  +  £,  yn  =  Y  +  r,  =>  a:„+i  =  1  -  aX2  +  Y  +  f ,  yn+1  =  bX  +  f], 
with  ^  =  — 2aX£  +  r],  fj  =       Circle  £2  +  J]2  =  e2  becomes 

[£  +  (2aX/&)^]2  +  fj2/b2  =  e2,  ellipse  with  semi-axes  e/y/JIi,  e/^/Jt^ 
where  \i\ii2  =  1/b2,  fii  +  fi2  =  1  +  (1  +  4a2X2)/&2.  Area  of  ellipse 
=  Tr\b\e2  =>■  area  reduction.  (1  —  /Ui)(l  —  ^2)  <  0  ==>■ 
0  <  /xi  <  1  <  ^2  (say)  =>  Lyapunov  exponents  Ai  >  0  >  A2. 

9.  (a)  Fixed  points  Pi  =  (1  +  a  -  6)_1(1,  &)  if  a  >  6  -  1, 

P2  =  (1  _  a_  6)-i(i,6)  if  a  >  1  -  6.  If  |6|  <  1  and  |a|  <  1  -  6  only  Pi 

exists,  X  —  b/X  =  —a  =>■  Pi  asymptotically  stable. 

(b)  For  nontrivial  2-cycle  Qi  =  (Xi,  Yj)  +±  Q2  =  (X2,Y2),  X1:X2  must 

be  of  opposite  sign.  Hence  X1.2  =  (1  —  b  ^  a)/[a2  +  (1  —  6)2], 

Y,2  =  bX2tl. 


0  1  2      x  0  1  2  x 


(b)  eigenvalues  /ii  =  2,  /l*2  =  5  =>  Ai,2  =  ±ln2  =  ±0.6931.  See 
diagram: 
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12.  (a)  a,/3  distinct  irrationals  with  (5  —  a  >  e  >  0,  then  choose  integer  N 
such  that  e  >  1/N;  mesh  integer  multiples  of  1/N,  at  least  one  is 
between  a  and  f3.  (b)  Since  y/2  is  irrational,  then  e.g., 

(1  —  l/y/2)(pi/qi)  +  (1/ V2)(P2/<?2)  is  irrational  and  lies  between  the 
rationals  pi /gi  andp2/?2- 

13.  We  may  take  both  0  and  I  to  be  2-7r-periodic  here.  See  diagram: 

k  =  0.8 


0  2  £        4  6 


[from  Reinhardt  and  Dana,  Proc.  Roy.  Soc.  413,  157-170,  1987] 
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instantaneous,  217,  224 
axis  of  symmetry,  207 

baker's  transformation,  443 
basin  of  attraction,  334,  426 
beam,  colliding,  169 
Bernoulli,  70 
Bertrand's  theorem,  351 
bicycle  wheel,  124 
bifurcation,  309,  428 
billiard  balls,  41 
billiard  systems,  363-366,  378 

circular,  364 

elliptical,  365,  378 

oval,  366,  441 
body  cone,  224 
Bohr  orbit,  84,  121,  126 
boost,  295,  298-300 
bounce  map,  364,  366 


bouncing  ball,  345 
boundary  conditions,  61 
brachistochrone,  70 
broken  symmetry,  363 
Brusselator,  328 
butterfly  effect,  339 

calculus  of  variations,  59-62,  233 

canonically  conjugate  pairs,  301 

Cartesian  co-ordinates,  4,  5,  382 

catastrophe,  309 

catenary,  72 

cause  and  effect,  339 

central  conservative  force,  8,  66, 

73-98,  279,  283-284,  350,  357-359, 

375 

Hamiltonian  for,  279 

impulsive,  90 

of  varying  strength,  371 

potential  energy  of,  66 
central  force,  55-57 

and  angular  momentum 
conservation,  291 

internal,  181,  189,  198 
centre,  315,  420,  422 

true,  422,  424 
centre  of  charge,  137 
centre  of  mass,  6,  138,  159,  177 

equation  of  motion  of,  159,  178, 
218 

centre-of-mass  frame,  162-164 
centrifugal  force,  77,  112,  235,  249 

potential  energy  of,  114,  141,  239 
centripetal  acceleration,  111 
Chandler  wobble,  224,  330 
chaos,  294,  334,  337,  338,  347 

routes  to,  375 
chaotic  attractor,  334 
characteristic  equation,  259,  314 
charge,  8 

centre  of,  137 

total,  137 
charge  density,  134,  150,  397 
charge-to-mass  ratio,  130 
charged  particle  in  electromagnetic 

field,  241-244,  302 
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chemical  oscillation,  327 
circular  billiard,  364 
circular  orbit,  78,  81,  410 

velocity  in,  83 
clockwork  universe,  339 
closest  approach  distance,  79-80 
co-ordinates 

Cartesian,  4,  5,  382 

curvilinear,  58,  398-400 

elliptic,  72,  378 

ignorable,  282-285 

normal,  264-266 

orthogonal,  58,  253-256,  398 

parabolic,  71 

polar,  57-59,  398-400 
coefficient  of  restitution,  42 
coins,  366 
collision,  26,  37 

elastic,  41,  165-173 

hard-sphere,  90-94,  169,  172 

inelastic,  42 

multiple,  95 

two-body,  11,  14,  39-42,  165-173 
combat  model,  322-323 
comet,  89,  99 

competing  species,  318-322  ,  342 
competitive  exclusion  principle,  321 
complex  amplitude,  25,  258 

of  forced  oscillation,  31,  265 
components,  of  vector,  382 
composite  body,  14,  178 

force  on,  178 

mass  of,  12,  178 

position  of,  5,  14,  178 
compound  pendulum,  200-202 
cone,  207 

moments  of  inertia,  227 

particle  on,  301 
conies,  86,  409-413 

Cartesian  form,  409-412 

polar  form,  412-413 
conservation  law 

in  constrained  system,  239 

of  angular  momentum,  55,  65,  66, 
74,  76,  182,  280,  283, 
291-293,  296-297 


of  energy,  9,  18,  25-26,  40,  50,  74, 
76,  188-190,  198,  280-282, 
295 

of  momentum,  11,  39,  160,  166, 
178,  296 
conservation  laws,  2,  76-78 

and  symmetries,  291-301,  347 

for  symmetric  top,  284 

in  polar  co-ordinates,  76 
conservative  force,  17-18,  49-51, 

188-190 

condition  for,  50,  395 
impulsive,  40 
internal,  189 

potential  energy  for,  18,  394 
conservative  system,  315-317 
constraint  equation,  231-232,  239 

and  Lagrange  multiplier,  240 
continuous  dynamics,  307,  312 
control  parameters,  308 
convection,  331 
Coriolis  acceleration,  111 
Coriolis  force,  112,  114-120,  125,  229, 

249 

Coulomb  gauge,  397 

Coulomb  scattering,  79-80 

Coulomb's  law,  8 

coupled  oscillators,  261-266 

critical  damping,  29,  46 

critical  line,  415 

critical  point,  311,  313,  415-424 

isolated,  415 
cross-section,  90-97,  168-173 

differential,  93,  97,  169,  171,  172 

total,  94,  170 
cube,  moments  of  inertia,  210 
cubic  map,  44% 
curl,  391 

in  polar  co-ordinates,  400 

of  curl,  393 

of  gradient,  393 

vanishing,  51,  394 
current  density,  397 
current  loop,  124 
curvilinear  co-ordinates,  58,  63, 

398-400 
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unit  vectors  for,  71 
cycloid,  70,  376 
cyclone,  119 
cyclotron,  109 

cyclotron  frequency,  109,  304 
cylinder,  207 

moments  of  inertia,  210 
cylindrical  polars,  57-59,  65,  71, 

398-400 

kinetic  energy  in,  58 
Lagrange's  equations  in,  65 

damped  oscillator,  27-29,  274,  341 
day,  105,  186 
degeneracy,  350,  373 
degrees  of  freedom,  232,  294 
Delaunay,  373 
denominators,  small,  373 
depression,  119 
determinant,  258,  416 
determinism,  339 
diagonalization,  403-404 
dice,  366 

differential  cross-section,  93 

for  Rutherford  scattering,  97 

hard-sphere,  93,  172 

in  CM  frame,  171 

in  Lab  frame,  169,  171 
differentiation  of  vector,  388 
dipole 

electric,  131 

magnetic,  124,  406 
dipole  moment,  132,  137 
discrete  dynamics,  307,  312 
discrete  map,  353 
discrete  systems,  425-441 
disease,  343 
divergence,  391 

in  polar  co-ordinates,  399 

of  curl,  393 
divergence  of  trajectories,  374 
dot  product,  384 

of  tensors,  401 
double  dot  product,  406 
double  pendulum,  255-257,  260,  369 
double  star,  14,  174 


dyadic,  401 

dynamo,  334-336  ,  345 

Earth,  xv,  140-144,  163,  214 
angular  velocity  of,  xv,  105, 

185-186 
core  of,  154,  336 
escape  velocity,  82 
magnetic  field  of,  334 
motion  near  rotating,  112-113 
orbit  of,  xv,  88,  98,  100 
precession  of  axis  of,  214,  224 
shape  of,  113,  140-144,  214 
Earth-Moon  distance,  xvi,  154,  163 
Earth-Moon  system,  163,  183-188, 
215 

eccentric  anomaly,  101 
eccentricity,  86 

of  Earth's  orbit,  xv,  88 
ecliptic,  215 

effective  potential  energy,  77,  235 
for  central  force,  283 
for  constrained  system,  239 
for  inverse  square  law,  78,  80 
for  symmetric  top,  285,  288,  302 

eigenvalue  equation,  258,  314,  403, 
416 

eigenvalues,  of  Jacobian  matrix,  314 

eigenvector,  258,  403 

Einstein,  1,  3,  10,  192 

elastic  bouncer,  357,  371,  379 

elastic  collision,  41,  165-173 

electric  charge,  8 

electric  dipole,  131 

electric  field,  130,  150,  241,  397 

electric  quadrupole,  132 

electromagnetic  field,  9,  397-398 

particle  in,  110,  241-244,  282,  302 
electromagnetic  force,  9,  110,  235,  241 
electron,  xv,  84,  167 
electrostatic  force,  8,  14,  130 
electrostatic  potential,  129-131 
elements,  3 

ellipse,  75,  81,  87,  410,  412 
ellipsoid,  moments  of  inertia  of,  210 
elliptic  co-ordinates,  72,  378 
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elliptic  orbit,  75,  86-87 

elliptic  point,  420,  440 

elliptical  billiard,  365,  378 

energy,  25-26 

and  time  translation  symmetry,  295 
conservation  of,  9,  18,  25-26,  40, 
50,  74,  76,  188-190,  198, 
280-282,  291 
kinetic,  17,  49,  161,  162,  188 
lost  in  inelastic  collision,  42 
potential,  18,  50,  161,  190,  394 
total,  18,  50,  280 

transferred  in  elastic  collision,  166 
energy  levels  of  hydrogen  atom,  84 
entropy,  433 
equal-area  map,  437 
equations  of  motion 

for  relative  motion,  160 

in  electromagnetic  field,  242,  250 

of  centre  of  mass,  159,  178,  218 

of  rigid  body,  199,  218 

of  small  oscillations,  256-257 
equilibrium,  253,  311 

condition  for,  256 

hydrostatic,  155 

motion  near,  20-25,  253-271 

of  rigid  body,  198 

solution,  313 

stability  of,  22,  259 
equipotential  surface,  132,  141,  142, 

147 

equivalence  principle,  10,  130 
equivalent  simple  pendulum,  201, 

202,  226 
ergodic  behaviour,  291 
ergodic  trajectory,  350 
escape  velocity,  82-83,  99 

from  Earth,  xv,  82 
Euclid's  axioms,  1 
Euclidean  geometry,  2 
Euler  angles,  221-223,  231 
Euler-Lagrange  equations,  61,  62 
external  forces,  13,  177 
external  potential  energy,  189,  191, 

198 

extinction,  321 


Feigenbaum,  428 
Feigenbaum  number,  429 
field,  390 

electric,  130,  150,  241,  397 

electromagnetic,  9,  241,  397-398 

gravitational,  130,  151 
field  equations,  148-152,  397-398 
figure  of  eight,  362 
first-order  convergence,  426 
fish  population,  312,  341 
fixed  point,  425,  434,  440 
flow,  308 
fluid,  114,  146 

incompressible,  290,  392 

velocity  field  in,  391 
flux  of  particles,  90,  94,  169 
flywheel,  227 
focus,  86,  413,  419 

of  charged-particle  beam,  126 

of  conic,  413,  4H 
folding,  353,  433,  436,  440 
force,  6,  12 

addition  of,  6,  382 

apparent,  112,  235 

as  basic  quantity,  15 

central,  55-57,  181,  189,  198 

central  conservative,  8,  66,  73-98, 
279,  283-284,  357-359,  375 

centrifugal,  77,  112,  114,  235 

conservative,  17-18,  49-51, 
188-190,  394,  395 

Coriolis,  112,  114-120 

definition  of,  12 

dissipative,  9,  26 

electromagnetic,  9,  110,  235,  241 

electrostatic,  8,  14,  130 

external,  13,  177 

generalized,  64,  234 

gravitational,  8,  14,  130 

impulsive,  37,  40,  202,  226 

inverse  square  law,  78-84,  129,  358 

Lorentz,  110,  241 

magnetic,  108 

on  composite  body,  178 

periodic,  30-36,  265 

saw-tooth,  47 
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square-wave,  35 
step-function,  39 
three-body,  7 
tidal,  147,  184 
total,  6 

two-body,  6,  177 

work  done  by,  26,  49,  188,  234 
forced  oscillation,  30-39,  265-266 
forced  system,  232-233,  239,  280 
Foucault's  pendulum,  117 
four-cycle,  435 
Fourier  integral  theorem,  36 
Fourier  series,  34,  271 
fractal,  333,  373,  436 
frame,  4 

accelerated,  127,  145 

centre-of-mass,  162-164 

inertial,  3-4,  295 

laboratory,  165 

rotating,  105-125,  218,  239 

transformation  to  moving,  14, 
298-300 
free  motion  of  rigid  body,  219, 

223-225 

freely  falling  body,  115-117,  127 
frequency,  23 

angular,  25 

cyclotron,  109 

fundamental,  270 

Larmor,  121 

of  normal  mode,  350 
friction,  9 

Galaxy,  99 

Galilean  transformation,  295-301 

Galileo,  2,  10 

Galton  board,  339  ,  346 

gauge  transformation,  250,  397 

Gauss  law,  150 

Gauss'  theorem,  396,  399 

general  relativity,  1,  3,  144,  192 

generalized  co-ordinates,  231-233 

generalized  force,  64,  234 

for  angular  co-ordinate,  234 
generalized  momentum,  64,  65,  235, 

277 


in  electromagnetic  field,  242 
generating  function,  292 
geodesic,  61,  70 
gradient,  390,  398 

in  polar  coordinates,  398 
gradient  system,  317,  342 
gravitational  acceleration,  51,  99,  130 

apparent,  112-113,  142 

of  Earth,  xv 
gravitational  constant,  xv,  8,  15,  82 
gravitational  field,  130,  151 

uniform,  159-161,  183,  191 
gravitational  force,  8,  14,  130 
gravitational  mass,  10 
gravitational  potential,  129-131,  152 
Green's  function,  37 
gyroscope,  212,  214,  228,  229,  250 

hairy  ball  theorem,  349 
half-width  of  resonance,  33 
Hamilton's  equations,  277-280 
Hamilton's  principle,  63,  234 

for  stretched  string,  245 
Hamiltonian  function,  278 

and  total  energy,  280 

for  central  conservative  force,  279 

for  charged  particle,  302 

for  symmetric  top,  284 

rate  of  change  of,  280 
Hamiltonian  system,  321 
hard-sphere  collision,  90-94,  169,  172 
harmonic  oscillator,  20-26,  73-76, 

312-313  ,  341,  345 

action/angle  variables,  356-357 

anisotropic,  73 

complex  representation  of,  24-25 
damped,  27-29,  341 
double,  350,  375 
forced,  30-39 

isotropic,  51,  73-76,  100,  377 
orbit  of,  75 

with  varying  frequency,  369 
Hartman-Grobman  theorem,  422 
heat,  26 

heat  conduction,  332 
helium  nucleus,  96 
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helix,  109 

Henon  map,  434,  437,  442 
Henon-Heiles  system,  368-369 
hidden  symmetry,  368 
holonomic  system,  232,  235 
homoclinic  intersection,  440 
Hopf  bifurcation,  326-328,  343 
Huygens,  377 

hydrogen  atom,  xv,  83-84,  126 
hyperbola,  79,  82,  88,  410 

in  polar  form,  413 
hyperbolic  point,  418,  440 

ignorable  co-ordinate,  282-285 
impact  parameter,  79,  89 
improper  node,  418 
improper  transformations,  300 
impulse,  37 

velocity,  180 
impulsive  force,  37,  40,  202,  226 
incompressible  fluid,  290,  392 
inelastic  collision,  42 
inertia  tensor,  205,  401 
inertial  frame,  3-4 

transformation  of,  295,  298-300 
inertial  mass,  10 
infinite  precision,  339 
inner  product  of  vectors,  402 
insect,  226 
integrability,  347-351 
integrable  system,  294,  347,  353,  437, 

438 

integral  theorems,  393-396 
integration  of  vector,  390 
internal  force,  177 

central,  181,  189,  198 

conservative,  189 
invariant  circle,  440 
invariant  probability  distribution,  432 
invariant  torus,  349 
inverse  cube  law,  102 
inverse  square  law,  78-84,  358 

orbit,  85-89 
inversion  symmetry,  300 
involution,  348 
isobars,  119 


isolated  critical  point,  415 
isolated  system,  6,  178 
isotropic  harmonic  oscillator,  51, 

73-76,  100 
iteration,  425 

Jacobi  integral,  360 
Jacobian  matrix,  314,  415 
Jupiter,  98,  100,  125,  154,  193,  304 
effect  on  Sun,  174 

KAM,  373,  438 
Kepler's  first  law,  103 
Kepler's  second  law,  57,  103 
Kepler's  third  law,  87,  103,  163,  358, 
375 

Kermack-McKendrick  theorem,  344 

kinetic  energy,  17,  49 
and  Euler  angles,  223 
in  CM  frame,  188 
in  orthogonal  co-ordinates,  58,  254 
near  equilibrium,  253 
of  forced  system,  233,  239 
of  many-body  system,  188 
of  natural  system,  233,  280 
of  particles  on  string,  266 
of  rotating  body,  199,  207 
of  stretched  string,  244 
of  two-body  system,  161,  162,  166 
rate  of  change,  17,  26,  49,  188,  198 
transferred,  166 

Kronecker's  symbol,  35 

laboratory  frame,  165 

differential  cross-section  in,  169, 
171,  172 

Lagrange  multiplier,  72,  195,  240, 

249,  40 6 
Lagrange's  equations,  62-66,  128, 

231-248 

for  conservative  system,  235 
for  many-body  system,  190-192 
for  non-conservative  force,  234 
for  two-body  system,  161 
in  polar  co-ordinates,  64-66,  234 
Lagrangian  density,  245 
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Lagrangian  function,  62,  235 

for  charged  particle,  242 

for  coupled  oscillators,  265 

for  many-body  system,  191 

for  pendulum,  64,  238 

for  stretched  string,  245 

for  symmetric  top,  236 

for  two-body  system,  161 

in  uniform  gravitational  field,  191 
Lagrangian  points,  251,  303,  378 
lamina,  199 
Lanchester's  law,  323 
Laplace's  equation,  151,  275 
Laplace-Runge-Lenz  vector,  376 
Laplacian,  393 

in  polar  co-ordinates,  400 

of  vector  field,  393,  406 
Larmor  effect,  120-124,  126,  362 
Larmor  frequency,  121,  228,  303 
laws  of  mechanics,  1-12,  14 
laws  of  nature,  1 
Legendre  polynomial,  131 
libration,  316,  351,  354 
limit  cycle,  324-328,  375 
line  integral,  394 
linear  independence,  22 
linear  map,  426 
linear  system,  314,  415-421 
Liouville's  theorem,  289-291 
liquid  surface,  rotating,  114,  125 
Lissajous  figures,  351 
logistic  equation,  309-311,  426 
logistic  map,  426,  432,  44 1 

attractors,  428 

in  standard  form,  427 
Lorentz  force,  110,  241 
Lorentz  transformation,  301 
Lorenz,  331,  339 
Lorenz  attractor,  436 
Lorenz  system,  331-334,  344,  353, 

424,  433,  434 
Lotka-Volterra  system,  318-321,  342 
Lozi  map,  44% 
lunar  theory,  373 

Lyapunov  exponent,  338,  353,  432, 

442 


Macdonald,  342 
magnetic  bottle,  349 
magnetic  dipole,  124,  406 
magnetic  field,  241,  397 
of  Earth,  334 

particle  in,  108-109,  120-124 

uniform,  243 
magnitude  of  vector,  381 
malaria,  342 
Mandelbrot  set,  460 
many-body  system,  177-192 

angular  momentum  of,  181-183 

energy  conservation  in,  188-190 

Lagrange's  equation  for,  190-192 
maps,  307,  373,  425-441 

area-filling,  353 

area-preserving,  353 

Arnold's  cat,  443 

bounce,  364,  366 

cubic,  442 

discrete,  353 

equal-area,  437 

Henon,  434,  437,  442 

linear,  426 

logistic,  426,  432,  441 

Lozi,  442 

nonlinear,  426 

Poincare  return,  352,  353 

standard,  443 

tent,  442 

twist,  437,  438 
Maskelyne,  15 
mass,  6 

additive  nature  of,  12 

gravitational  and  inertial,  10 

measurement  of,  10 

of  composite  body,  12,  178 

reduced,  160 
matchbox,  220 

maximum  scattering  angle,  168 
Maxwell's  equations,  397 
mean  free  path,  94-96 
Mercury,  144,  186 
moment  of  force,  53 
moment  of  inertia,  199,  203 
calculation  of,  208-211 
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principal,  206 
shift  of  origin,  208 
momentum,  6 

as  generator  of  translation,  296 
conservation  of,  11,  39,  160,  178, 
296 

generalized,  64,  65,  235,  242,  277 

in  CM  frame,  162 

of  many-body  system,  178 

of  rigid  body,  197 

rate  of  change,  6,  178,  197 

monopole,  132,  137 

Moon,  xvi,  99,  163,  183-188 
precession  of  orbit,  157 
tides  due  to,  144-148,  154 

moving  frame,  14,  298-300 

multiple  collisions,  95 

multiply  periodic  trajectory,  350 

n-body  problem,  359-362,  374 
n-torus,  437 

natural  system,  232-233 

condition  for,  280 
nautical  mile,  102 
neap  tides,  146 

near-integrable  system,  372-374,  438 

nested  tori,  349,  437 

Newton,  2,  70 

Newton's  first  law,  4,  178 

Newton's  gravitational  constant,  xv, 

8,  15,  82 
Newton's  law  of  gravity,  8 
Newton's  laws,  5-10 
Newton's  second  law,  6,  12,  178 
Newton's  third  law,  7,  9,  11,  159,  178, 

406 

Newton-Raphson  iteration,  426 
node,  422 

improper,  418 

proper,  419 
nodes  on  stretched  string,  270 
non-holonomic  system,  232 
non-integrable  system,  347,  353 
non-invertibility,  433 
nonlinear  map,  426 
normal  co-ordinates,  264-266 


for  stretched  string,  274 
normal  modes,  258-271 
nonlinear,  350 

of  coupled  oscillators,  261-266 
of  double  pendulum,  260 
of  particles  on  string,  266-269 
of  stretched  string,  269-271 
nucleus,  96 

oblateness,  139 

of  Earth,  140,  141 

of  Jupiter,  154 
one-dimensional  maps,  425-433 
orbit,  84-89 

Bohr,  84 

circular,  81 

elliptical,  86-87,  101 

hyperbolic,  88-89 

in  phase  space,  308 

in  rotating  frame,  120 

inverse  cube  law,  102 

inverse  square  law,  85-89 

of  comet,  89,  99 

of  Earth,  88,  100 

of  Jupiter,  98 

of  oscillator,  75 

of  satellite,  98,  143,  156 

precession  of,  103,  120,  143 

transfer,  100,  193 
orientation  of  rigid  body,  221 
orthogonal  co-ordinates,  58,  253-256, 

398 

orthogonal  tensor,  406 
orthogonality,  348 

of  eigenvectors,  403 
orthonormal  triad,  383,  404 
oscillation,  forced,  30-39,  265-266 
oscillator,  20-26,  73-76,  312-313, 

341,  345 

action/angle  variables,  356-357 
complex  representation  of,  24-25 
damped,  27-29,  274 
forced,  30-39,  4 6 
isotropic,  51,  73-76,  100,  377 
with  varying  frequency,  369 
oscillators,  coupled,  261-266 
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outer  product  of  vectors,  402 
oval  billiard,  366,  441 

parabola,  52,  82,  90,  126,  411 

in  polar  form,  413 
parabolic  co-ordinates,  71 
paraboloid,  114 
parallax,  174 

parallel  axes  theorem,  209 
parallelepiped,  387 

moments  of  inertia,  210 
parallelogram  law,  6,  382 
parity,  300 
particle,  5 

path,  shortest,  59,  61,  70 
pendulum,  18-19,  43,  45,  46,  64,  226, 

227,  316-317,  341 

compound,  200-202 

constrained,  238-241,  302 

coupled,  261-266 

double,  255-257,  260,  369 

equivalent  simple,  201,  226 

forced  damped,  336 

Foucault's,  117 

hanging  from  trolley,  249,  302 

of  varying  length,  370,  379 

period  of,  48 
perihelion,  101 
period 

of  orbit,  xv,  74,  87,  163 

of  oscillation,  23,  200 

of  rotation,  105 
period-doubling,  375 
period-doubling  bifurcation,  428 
period-doubling  cascade,  428,  435 
periodic  force,  30-36,  265 
periodic  trajectory,  350 
Perron-Frobenius  equation,  432 
perturbed  twist  map,  437 
phase  line,  309-312 
phase  of  forced  oscillation,  31,  33 

at  resonance,  34 
phase  plane,  312-318,  415-424 
phase  portraits,  307-309 

local  or  global,  424 
phase  space,  289,  307-309 


phase  velocity,  308 
pion,  175 

planet,  44,  101,  103,  195 

plumb  line,  15,  112,  143 

Pluto-Charon  system,  163 

Poincare,  307,  339 

Poincare  return  map,  352,  353 

Poincare  section,  364,  366,  437,  441, 

443 

Poincare-Birkhoff  theorem,  440 
point  particle,  5 
Poisson  bracket,  293  ,  304  ,  348 
Poisson's  equation,  151,  397 
polar  co-ordinates,  57-59,  398-400 

acceleration  in,  71 

Lagrange's  equation  in,  64-66 

unit  vectors  for,  71 

velocity  in,  56 
polar  vector,  55,  386 
position  vector,  5,  6,  381 

of  composite  body,  5,  178 

relative,  7,  159,  162,  181,  182,  299 
potential,  129-152 

electrostatic,  129-131,  151 

expansion  at  large  distances,  137 

gravitational,  129-131,  152 

of  centrifugal  force,  141 

of  dipole,  132 

of  Earth,  140 

of  quadrupole,  132 

of  spherical  distribution,  134 

of  spheroid,  139 

scalar  and  vector,  241,  397 

tidal,  144 
potential  energy,  18,  50,  394 

effective,  77,  78,  80,  235,  239,  283, 
285,  288 

external,  189,  191,  198 

in  many-body  system,  189 

internal,  161,  189,  191 

inverse  square  law,  78 

near  equilibrium,  20,  256 

of  central  force,  66 

of  centrifugal  force,  114,  141,  239 

of  harmonic  oscillator,  20,  50 

of  impulsive  force,  40 
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of  particles  on  string,  267 

of  simple  pendulum,  19 

of  stretched  string,  244 

rate  of  change,  50 
precession,  103,  108,  211-215,  330 

Larmor,  120,  362 

of  Earth's  axis,  224,  330 

of  equinoxes,  214-215,  229 

of  orbit,  143,  156,  157 

of  rolling  wheel,  228 

of  symmetric  top,  213,  228, 
236-237,  249 

planetary,  144 
predictability,  337-339 

loss  of,  339,  374 
pressure,  in  fluid,  155 
prey-predator  system,  318-321 
Prigogine  and  Lefever,  327 
principal  axes  of  inertia,  205-211 

rotation  about,  218-220 
principal  moment  of  inertia,  206,  329 
Principia,  2 

probability  distribution  attractors, 
432 

product  of  inertia,  203 

shift  of  origin,  208 
projectile,  51-53,  68,  82,  127 

range  of,  52,  68,  102 

trajectory  of,  52,  53,  68 
projection  of  vector,  384 
prolate  spheroid,  139 
proper  node,  419 
proton,  xv,  14,  167 
pseudoscalar,  300 
pulley,  248 

quadrupole,  132 

quadrupole  moment,  133,  139 

of  Earth,  140,  154,  155 
quadrupole  tensor,  407 
quality  factor,  28,  33,  47 
quantum  mechanics,  2,  9,  20,  67,  83, 

301 

quasiperiodic  trajectory,  350 
radial  energy  equation,  77,  280 


for  inverse  square  law,  78 
radiation  gauge,  397 
range  of  projectile,  52,  68,  102 
range  of  tides,  147 
rate  constant,  426 
rate  of  particle  detection,  92,  171 
rational  approximation,  373 
Rayleigh's  equation,  324-326,  343 
reaction  on  axis,  200-203,  218,  226 
rectangular  plate 

moments  of  inertia,  210 
reduced  mass,  160 
reflection  symmetry,  205,  300 
relative  position,  7,  159,  162,  181, 

182,  299 
relative  velocity,  7,  299 

in  elastic  collision,  41 
relativity,  3 

general,  1,  3,  144,  192 

principle  of,  3-4,  10,  26,  198, 
295-301 

special,  2,  10,  109,  301 
relaxation  oscillation,  325 
relaxation  time,  28 
renormalization,  429 
repeller,  310 

resonance,  32-34,  148,  187,  373 

half-width  of,  33 
Richardson,  324 
right-hand  rule,  105,  385 
rigid  body,  197-225 

and  Euler  angles,  221 

angular  momentum  of,  197,  199, 
203-208,  218 

angular  velocity  of,  216-218 

energy  of,  198 

free  motion  of,  223-225,  228 
generalized  co-ordinates  for,  231 
kinetic  energy  of,  199,  207,  233 
precession  of,  211-215 
symmetric,  207-208,  222 

rigid-body  rotation,  329-330  ,  343 

Rikitake  dynamo,  334-336,  345 

Roche  limit,  155,  186 

rocket,  102,  179-181,  192 

Ross,  342 


476 


Classical  Mechanics 


rotating  frame,  4,  105-125,  218,  239 

acceleration  in,  111-112 

rate  of  change  of  vector  in,  106-108 
rotation,  317 

rotation  about  an  axis,  198-202 

stability  of,  218-220 
rotation  matrix,  417 
rotation  number,  437,  438 
rotation  of  axes,  383 
rotational  symmetry,  205 

and  angular  momentum 

conservation,  291,  292, 
296-297 
Routh's  rule,  209-211 
runoff  problem,  318 
Rutherford  scattering,  96-97 

saddle,  418 
Sarkovskii,  430 
satellite,  13,  163 

orbit  of,  143,  156,  194 

synchronous,  98,  192 
scalar,  5,  300,  381 
scalar  field,  390 
scalar  potential,  241,  397 
scalar  product,  384 
scalar  triple  product,  387 
scattering,  90-94,  96-97,  165-173 

by  multiple  targets,  95 

Coulomb,  79-80 

cross-section,  90-97,  168-173 

hard-sphere,  90-94,  172 

Rutherford,  96-97 
scattering  angle,  89,  91 

maximum,  168 
Schiehallion,  15 
second-order  convergence,  426 
semi-axes,  75,  87,  88,  410 
semi-latus  rectum,  86 
semi-major  axis,  76,  87,  410 

of  Earth's  orbit,  xv,  98 
semi-minor  axis,  89,  410 
sensitivity  to  initial  conditions, 

337-339,  353,  432,  440 
separatrix,  317,  341,  372 
sidereal  day,  105 


sigmoid  curve,  310 

similarity  transformation,  417 

SIR  model,  343 

slingshot,  101 

Smale  horseshoe,  436,  440 

small  denominators,  373 

small  oscillations,  253-271 

of  particles  on  string,  266-269 

of  pendulum,  200 

of  rotating  body,  220 

of  stretched  string,  244-248, 
269-271 
Solar  System,  360,  374 
solid  angle,  93 
source  or  sink,  419 
space,  2-4 
space  cone,  224 
space  translation,  296 
spacecraft,  100,  180,  193,  227,  251 
sphere 

moments  of  inertia,  210,  227 

potential  of,  135 

rolling  on  plane,  232 
spherical  charge  distribution,  134-136 
spherical  polars,  57-59,  65,  71, 

398-400 

kinetic  energy  in,  58 

Lagrange's  equations  in,  65 

volume  element  in,  134 
spherical  shell,  134 
spheroid,  139,  207,  228 
spiral  point,  419,  422 
spiral  source  or  sink,  419 
spring,  69 
spring  tides,  146 
square  of  vector,  384 
squeezing,  353,  436,  440 
stability 

and  Laplace's  equation,  275 

asymptotic,  310 

of  equilibrium,  22,  259 

of  Lagrangian  points,  251,  303 

of  rotation,  218-220,  239 

of  vertical  top,  288-289 

structural,  309 
stable  manifold,  315 
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theorem,  423 
stable  point,  314 
standard  map,  443 
standing  wave,  270 
star,  419 

steady  convection,  333 

steradian,  93 

Stokes'  theorem,  394 

strange  attractor,  334,  338,  436 

stress  tensor,  407 

stretched  string,  244-248 

normal  modes  of,  269-271 

particles  on,  266-269 
stretching,  353,  433,  436,  440 
structural  stability,  309 
subcritical  bifurcation,  327 
subtraction  of  vectors,  382 
Sun,  xvi,  98,  125,  304 

angular  position  of,  174 

effect  on  Earth's  rotation,  215 

potential  of,  136 

tides  due  to,  144-148,  187 
supercritical  bifurcation,  327,  343 
supercritical  flip  bifurcation,  428 
superposition  principle,  21,  259 
surface  of  rotating  liquid,  114,  141 
surface  of  section,  351-353 
surface,  closed,  148,  396 
symmetric  rigid  body,  207-208,  222 

angular  momentum  of,  207,  222 

free  motion  of,  223-225 

kinetic  energy  of,  223,  233 
symmetric  tensor,  205,  401 
symmetric  top,  236-237,  284-289,  363 

conservation  laws  for,  284,  291 

free  to  slide,  250,  302 

Hamiltonian  for,  284 

Lagrangian  for,  236 

precession  of,  213,  228,  236-237 

vertical,  288-289 
symmetries  and  conservation  laws, 

291-301,  347 
synchronous  satellite,  98,  192 

Tacoma  Narrows  bridge,  326 
target  particle,  165 


tautochrone,  377 

tennis  racquet  theorem,  220,  329 

tension  in  string,  244 

tensor,  400-405,  415,  424 

inertia,  205,  401 

symmetric,  205,  401,  403-404 
tensor  product  of  vectors,  402 
tent  map,  44% 
terminal  speed,  45 
three-body  problem,  359-362 

restricted,  251,  303 
three-body  system,  12,  14 
tides,  144-148,  154,  184,  195 

atmospheric,  187 

due  to  Earth,  154,  155 

due  to  oceans,  155 

spring  and  neap,  146 
time,  2-4 

time  translation,  295 

top,  212,  236-237,  284-289,  363 

torus,  349,  352,  373 

torus  breakdown,  437 

total  cross-section,  94 
and  mean  free  path,  94 
for  hard-sphere  scattering,  94 
for  Rutherford  scattering,  97 

trace,  406,  416 

trade  winds,  119 

trajectory 
closed,  350 
in  phase  space,  308 
of  projectile,  52,  53,  68 

transient,  32 

translation  symmetry,  295-296 

transposed  tensor,  401 

triple  product,  387-388 

Trojan  asteroids,  304 

true  anomaly,  101 

true  centre,  422,  424 

turbulent  convection,  333 

twist  map,  437,  438 

two-body  collision,  11,  14,  39-42, 

165-173 
two-body  system,  159-173 

Lagrangian  for,  161 
two-cycle,  434,  435 
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two-dimensional  maps,  433-437 

unaccelerated  observer,  3 
uncertainty  principle,  301 
uniform  gravitational  field,  159-161, 
183 

uniform  magnetic  field,  108-109,  243 
unit  vector,  5,  382 
units,  xvi 

universal  function,  429 
unstable  manifold,  315 
unstable  point,  310,  314,  418,  419 

valence,  of  tensor,  400 
Van  der  Pol  equation,  326,  343 
variation,  of  integral,  61,  233 
vector,  4-5,  381 

angular  velocity,  105 

axial,  55,  105,  386 

components  of,  382 

differentiation  of,  388 

integration  of,  390 

polar,  55,  386 

position,  5,  381 

unit,  5,  382 

zero,  382 
vector  field,  390 

Laplacian  of,  393,  406 

line  integral  of,  394 
vector  potential,  241,  397 
vector  product,  385 


vector  triple  product,  388 
velocity,  6 

circular  orbital,  81 

escape,  82-83,  99 

in  rotating  body,  106-108 

in  rotating  frame,  111 

of  centre  of  mass,  160,  178,  299 

of  light,  xv 

of  propagation,  246 

polar  components  of,  56,  58 

relative,  7,  40,  299 
velocity  impulse,  180 
Verhulst,  309 
vertical  top,  288-289 
violin  string,  324 
virial  theorem,  101,  196 
Volterra's  principle,  321 
vorticity,  392 

war,  322-324 
wave  equation,  246 
wave,  electromagnetic,  9 
wavelength,  of  normal  mode  of  string, 
270 

width,  of  resonance,  32 

work  done  by  force,  26,  49,  188,  234 

Zeeman  effect,  121 
zero  vector,  382 

zero-velocity  curve,  360,  367,  378 


